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Document : 882263 

THE GENE CLUSTER INVOLVED IN SAFRACIN BIOSYNTHESIS 
AND ITS USES FOR GENETIC ENGINEERING 

FIELD OF THE INVENTION 

The present invention relates to the gene cluster responsible for the 
• biosynthesis of safracin, its uses for genetic engineering and new safracins 
obtained by manipulation of the biosynthesis mechanism. 

BACKGROUND OF THE INVENTION 

Safracins, a family of new compounds with a potent broad-spectrum 
antibacterial activity, were discovered in a culture broth of Pseudomonas 
sp. Safracin occurs in two Pseudomonas sp. strains, Pseudomonas 
fluorescens A2-2 isolated from a soil sample collected in Tagawagun, 
Fukuoka, Japan (Ikeda et al., 1983; WO 82 00146 and JP 58113192) and 
Pseudomonas fluorescens SC 12695 isolated from water samples taken 
from the Raritan-Delaware Canal, near New Jersey (Meyers et al., 1983). 
Safracins A and B, produced by Pseudomonas fluorescens A2-2, have been 
examined against different tumor cell lines and has been found to possess 
antitumor activity in addition to antibacterial activity. 




Safracins A R=H ET-743 
B R=OH 



Due to the structural similarities between safracin B and ET-743 
safracin offers the possibility of hemi-synthesis of the highly promising 
potent new antitumor agent ET-743, isolated from the marine tunicate 
Ecteinascidia turbinate, and which is currently in Phase II clinical trials in 
Europe and the United States. A hemisynthesis of ET-743 has been 
achieved starting from safracin B (Cuevas et al., 2000; WO 00 69862 and 
WO 01 87895). 

As an alternative of making safracins or its structural analogs by 
chemical synthesis, manipulating genes of governing secondary 
metabolism offer a promising alternative and allows for preparation of 
these compounds biosynthetically. Additionally, safracin structure offers 
exciting possibilities for combinatorial biosynthesis. 

In view of the complex structure of the safracins and the limitations 
in their obtention from Pseudomonas fluorescens A2-2, it would be highly 
desirable to understand the genetic basis of their synthesis in order to 
create the means to influence them in a targeted manner. This could 
increase the amounts of safracins being produced, because natural 
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production strains generally yield only low concentrations of the secondary 
metabolites that are of interest. It could also allow the production of 
safracins in hosts that otherwise do not produce these compounds. 
Additionally, the genetic manipulation could be used for combinatorial 
creation of novel safracin analogs that could exhibit improved properties 
and that could be used in the hemi-synthesis of new ecteinascidins 
compounds. 

However, the success of a biosynthetic approach depends critically 
on the availability of novel genetic systems and on genes encoding novel 
enzyme activities. Elucidation of the safracin gene cluster contributes to 
the general field of combinatorial biosynthesis by expanding the repertoire 
of genes uniquely associated with safracin biosynthesis, leading to the 
possibility of making novel precursors and safracins via combinatorial 
biosynthesis. 



SUMMARY OF THE INVENTION 

We have now been able to identify and clone the genes of safracin 
biosynthesis, providing the genetic basis for improving and manipulating 
in a targeted manner the productivity of Pseudomonas sp., and using 
genetic methods, for synthesising safracin analogues. Additionally, these 
genes encode enzyms that are involved in biosynthetic processes to 
produce structures, such as safracin precursors, that can form the basis 
of combinatorial chemistry to produce a wide variety of compounds. 
These compounds can be screened for a variety of bioactivities including 
anticancer activity. 

Therefore in a first aspect the present invention provides a nucleic 
acid, suitably an isolated nucleic acid, which includes a DNA sequence 



(including mutations or variants thereof), that encodes proteins that direct 
the biosynthesis of the non ribosomal peptide compounds, safracins. This 
invention provides a gene cluster, suitably an isolated gene cluster, with 
open reading frames encoding polypeptides to direct the assembly of a 
safracin molecule. 

One aspect of the present invention is a composition including at 
least one nucleic acid molecule, suitably an isolated nucleic acid molecule, 
that encodes at least one polypeptide that catalyses at least one step of the 
biosynthesis of safracins. Two or more such nucleic acid molecules can 
be present in the composition. DNA or corresponding RNA is also 
provided. 

In particular the present invention is directed to a nucleic acid 
molecule, suitably an isolated nucleic acid molecule, from a safracin gene 
cluster comprising said nucleic acid molecule, a portion or portions of said 
nucleic acid molecule wherein said portion or portions encode a 
polypeptide or polypeptides or a biologically active fragment of a 
polypeptide or polypeptides, a single-stranded nucleic acid molecule 
derived from said nucleic acid molecule, or a single stranded nucleic acid 
molecule derived from a portion or portions of said nucleic acid molecule, 
or a double-stranded nucleic acid molecule derived from the single- 
stranded nucleic acid molecule (such as cDNA from mRNA). The nucleic 
acid molecule can be DNA or RNA. 

More particularly, the present invention is directed to a nucleic acid 
molecule, suitably an isolated nucleic acid molecule, which includes or 
comprises at least SEQ ID 1, variants or portions thereof, or at least one of 
the sacA, sacB, sacC, sacC, sacD, sacE, sacF, sacG, sacH, sacH } sad, sacJ, 
orfl, orf2, orf3 or orf4 genes, including variants or portions. Portions can 
be at least 10, 15, 20, 25, 50, 100, 1000, 2500, 5000, 10000, 20000, 



25000 or more nucleotides in length. Typically the portions are in the 
range 100 to 5000, or 100 to 2500 nucleotides in length, and are 
biologically functional. 

Mutants or variants include polynucleotide molecules in which at 
least one nucleotide residue is altered, substituted, deleted or inserted. 
Multiple changes are possible, with a different nucleotide at 1, 2, 3 4 5 
10, 15, 25, 50, 100, 200, 500 or more positions. Degenerate variants are 
envisaged which encode the same polypeptide, as well as non-degenerate 
variants which encode a different polypeptide. The portion, mutant or 
variant nucleic acid molecule suitably encodes a polypeptide which retains 
a biological activity of the respective polypeptide encoded by any of the 
open reading frames of the safracin gene cluster. Allelic forms and 
polymorphisms are embraced. 

The invention is also directed to an isolated nucleic acid molecule 
capable of hybridizing under stringent conditions with a nucleic acid 
molecule of this invention. Particularly preferred is hybridisation with a 
translatable length of a nucleic acid molecule of this invention. 

The invention is also directed to a nucleic acid encoding a 
polypeptide which is at least 30%, preferably 50%, preferably 60%, more 
preferably 70%, in particular 80%, 90%, 95% or more identical in amino 
acid sequence to a polypeptide encoded by any of the safracin gene cluster 
open reading frames sacA to sacJ and orfl to orf4 (SEQ ID 1 and genes 
encoded in SEQ ID 1) or encoded by a variant or portion thereof. The 
polpyeptide suitably retains a biological activity of the respective 
polypeptide encoded by any of the safracin gene cluster open reading 
frames. 



In particular, the invention is directed to an isolated nucleic acid 



molecule encoding for any of SacA, SacB, SacC, SacD, SacE, SacF, SacG, 
SacH, Sad, SacJ, Orfl, Orf2, Orf3 or Orf4 proteins (SEQ ID 2-15), and 
variants, mutants or portions thereof. 

In one aspect, an isolated nucleic acid molecule of this invention 
encodes a peptide synthetase, a L-Tyr derivative hidroxylase, a L-Tyr 
derivative methylase, a L-Tyr O-methylase, a methyl-transferase or a 
monooxygenase or a safracin resistance protein. 

The invention also provides a hybridisation probe which is a nucleic 
acid molecule as defined above or a portion thereof. Probes suitably 
comprise a sequence of at least 5, 10, 15, 20, 25, 30, 40, 50, 60, or more 
nucleotide residues. Sequences with a length on the range 25 to 60 are 
preferred. The invention is also directed to the use of a probe as defined 
for the detection of a safracin or ecteinascidin gene. In particular, the 
probe is used for the detection of genes in Ecteinascidia turbinata. 

In a related aspect the invention is directed to a polypeptide encoded 
by a nucleic acid molecule as defined above. Full sequence, variant, 
mutant or fragment polypeptides are envisaged. 

In a further aspect the invention is directed to a vector, perferably an 
expression vector, preferably a cosmid, comprising a nucleic acid molecule 
encoding a protein or biologically active fragment of a protein, wherein said 
nucleic acid is as defined above. 

In another aspect the invention is directed to a host cell transformed 
with one or more of the nucleic acid molecules as defined above, or a 
vector, an expression vector or cosmid as defined above. A preferred host 
cell is transformed with an exogenous nucleic acid comprising a gene 
cluster encoding polypeptides sufficient to direct the assembly of a safracin 



or safracin analog. Preferably the host cell is a microorganism, more 
preferably a bacteria. 



The invention is also directed to a recombinant bacterial host cell in 
which at least a portion of a nucleic acid molecule as denned above is 
disrupted to result in a recombinant host cell that produces altered levels 
of safracin compound or safracin analogue, relative to a corresponding 
nonrecombinant bacterial host cell. 



The invention is also directed a method of producing a safracin 
compound or safracin analogue comprising fermenting, under conditions 
and in a medium suitable for producing such a compound or analogue, an 
organism such as Pseudomonas sp, in which the copy number of the 
safracin genes/cluster encoding polypeptides sufficient to direct the 
assembly of a safracin or safracin analog has been increased. 

The invention is also directed a method of producing a safracin 
compound or analogue comprising fermenting, under conditions and in a 
medium suitable for producing such compound or analogue, an organism 
such as Pseudomonas sp in which expression of the genes encoding 
polypeptides sufficient to direct the assembly of a safracin or safracin 
analogue has been modulated by manipulation or replacement of one or 
more genes or sequence responsible for regulating such expression. 
Preferably expression of the genes is enhanced. 

The invention is also directed to the use of a composition including 
at least one isolated nucleic acid molecule as defined above or a 
modification thereof for the combinatorial biosynthesis of non-ribosomal 
peptides, diketopiperazine rings and safracins. 

In particular the method involves contacting a compound that is a 
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substrate for a polypeptide encoded by one or more of the safracin 
biosynthesis gene cluster open reading frames as defined above with the 
polypeptide encoded by one or more safracin biosynthesis gene cluster 
open reading frames, whereby the polypeptide chemically modifies the 
compound. 

In still another embodiment, this invention provides a method of 
producing a safracin or safracin analog. The method involves providing a 
microorganism transformed with an exogenous nucleic acid comprising a 
safracin gene cluster encoding polypeptides sufficient to direct the 
assembly of said safracin or safracin analog; culturing the bacteria under 
conditions permitting the biosynthesis of safracin or safracin analog; and 
isolating said safracin or safracin analog from said cell. 

The invention is also directed to any of the precursor compounds P2, 
P14, analogs and derivatives therof and their use in the combinatorial 
biosynthesis non-ribosomal peptides, diketopiperazine rings and safracins. 

Additionally, the invention is also directed to the new safracins 
obtained by knock out safracin P19B, safracin P22A, safracin P22B, 
safracin D and safracin E, and their use as antimicrobial or antitumor 
agents, as well as their use in the synthesis of ecteinascidin compounds. 

The invention is also directed to new safracins obtained by directed 
biosynthesis as defined above, and their use as antimicrobial or antitumor 
agents, as well as their use in the synthesis of ecteinascidin compounds. 
In particular the invention is directed to safracin B-ethoxy and safracin A- 
ethoxy and their use. 

In one aspect, the present invention enables the preparation of 
structures related to safracins and ecteinascidins which cannot or are 
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difficult to prepare by chemical synthesis. Another aspect is to use the 
knowledge to gain access to the biosynthesis of ecteinascidins 
Ecteinascidia turbinata, for example using these sequences or parts 
probes in this organism or a putative symbiont. 



m 

as 



More fundamentally, the invention opens a broad field and gives 
access to ecteinascidins by genetic engineering. 



BRIEF DESCRIPTION OP THE FIGURES 

Fig. 1: Structural organization of the chromosomal DNA region cloned in 
pL30p cosmid. The region of P. fluoresces A2-2 DNA, containing the 
safracin gene cluster, is shown. Both, sacABCDEFGH anf sacIJ, gene 
operons and the modular organization of the peptide synthetases deduced 

from sacA, sacB and sacC are illustrated. The following domains are 

indicated: C: condensation; T: thiolation; A: adenylation and Re: reductase. 

Location of other genes present in pL30p cosmid (orfl to orf4) as well as 

their proposed fuction is shown. 

Fig. 2: Conserved core motifs between NRPSs. Conserved amino acid 
sequences in SacA, SacB and SacC proteins and their comparison with its 
homologous sequences from Myxococcus xanthus DM50415. 

Figure 3. NRPS biosynthesis mechanism proposed for the formation of the 
Ala-Gly dipeptide. Step a*, adenylation of Ala; b*, transfer to the 4'- 
phosphopantetheinyl arm; c*, transfer to the waiting/elongation site; d* 
adenylation of the Gly; e*, transfer to the 4'-phosphopantetheinyl arm- f*' 
condensation of the elongation chain on the 4--phosphopantetheinyl arm 
wxth the starter chain at the waiting/ elongation site; g*, Ala-Gly dipeptide 
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attached to the phosphopantetheinyl arm of SacA and h*, transfer of the 
elongated chain to the following waiting/elongation site. 

Fig. 4: Cross-feeding experiments. A. Scheme of A2-2 DNA fragments 
cloned in pBBRl-MCS2 vector and products obtained in the heterologous 
host. B. HPLC profile of safracin production in wild type strain versus sacF 
mutant. The addition of P2 precursor to the sacF mutant, provided both in 
trans and synthetically, yield safracin B production. SfcA, safracin A and 
SfcB, safracin B. 

Fig. 5: Scheme of the safracin biosynthesis mechanism and biosynthetic 
intermediates. Single enzymatic steps are indicated by a continuous arrow 
. and multiple reactions steps are indicated by discontinuous arrows. 

Fig. 6: Safracin gene disruptions and compounds produced. A. Gene 
disruption and precursor molecules synthesized by the mutants 
constructed. Gene marked with an asterisk does not belong to the safracin 
cluster. Inactivation of genes or/2, orf3 and orf4 is in progress. B. HPLC 
profile of safracin production in wild type strain and in sacA, sad and sacJ 
mutants. Structure of the different molecules obtained is shown. 

Fig. 7: Structure of the different molecules obtained by gene disruption. 
Inactivation of SacJ protein (a*) yields P22B, P22A and PI 9 molecules, 
whereas gene disruption of sacl (b*), produces only P19 compound. The 
sad disruption, together with the sacJ reconstructed expression, produces 
two new safracins: safracin D (possible precursor for ET-729 hemi- 
synthesis) and safracin E (c*). 

Fig. 8: Addition of specific designed ^unnatural' precursors (P3). Chemical 
structure of the two molecules obtained by addition of P3 compound to the 
sacF mutant. 
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Fig. 9: Scheme of the gene disruption event through simple recombination, 
using an homologous DNA fragment cloned into pK18:MOB (an integrative 
plasmid in Pseudomonas) . 



DETAILED DESCRIPTION OP THE INVENTION 

Non ribosomal peptide synthetases (NRPS) are enzymes responsible 
for the biosynthesis of a family of compounds that include a large number 
of structurally and functionally diverse natural products. For example, 
peptides with biological activities provide the structural backbone for 
compounds that exhibit a variety of biological activities such as, 
antibiotics, antiviral, antitumor, and immunosuppressive agents (Zuber et 
al., 1997; Marahiel et al., 1997). 

Although structurally diverse, most of these biologically active 
peptides share a common mechanistic scheme of biosynthesis. According 
to this model, peptide bond formation takes place on multienzymes 
designated peptides synthetases, on which amino acid substrates are 
activated by ATP hydrolysis to the corresponding adenylate. This unstable 
intermediate is subsequently transferred to another site of the 
multienzyme where it is bound as a thioester to the cysteamine group of 
an enzyme-bound 4--phosphopantetheninyl (4'-PP) cofactor. At this stage, 
the thiol-activated substrates can undergo modifications such as 
epimerisation or N-methylation. Thioesterified substrate amino acids are 
then integrated into the peptide product through a step-by-step elongation 
by a series of transpeptidation reactions. With this template arrangement 
in peptide synthetases, the modules seem to operate independently of one 
another, but they act in concert to catalyse the formation of successive 
peptide bonds (Stachelhaus et al., 1995; Stachelhaus et al., 1996). The 
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general scheme for non-ribosomal peptide biosynthesis has been widely 
reviewed (Marahiel et al., 1997; Konz and Marahiel, 1999; Moffit and 
Neilan, 2000). 

A large number of bacterial operons and fungal genes encoding 
peptide synthetases have recently been cloned, sequenced and partially 
characterized, providing valuables insights into their molecule architecture 
(Marahiel, 1997). Different cloning strategies were used, including probing 
of expression libraries by antibodies raised against peptide synthetases, 
complementation of deficient mutants, and the use of designed 
oligonucleotides derived from amino acid sequences of peptide synthetase 
fragments. 

Analysis of the primary structure of these genes revealed the 
presence of distinct homologous domains of about 600 amino acids. This 
specific functional domains consist of at least six highly conserved core 
sequences of about three to eight amino acids in length, whose order and 
location within all known domains are very similar (Kusard and Marahiel, 
1994). The used of degenerated oligonucleotides derived from the 
conserved cores opens the possibility of identifying and cloning peptide 
synthetases from genomic DNA, by using the polymerase chain reaction 
(PCR) technology (Kusard and Marahiel, 1994; Borchert et al., 1992). 

The structure of safracin suggests that this compound is synthesized 
by a NRPS mechanism. The cloning and expression of the non-ribosomal 
peptide synthetases and the associated tailoring enzymes from 
Pseudomonas fluoresceins A2-2 safracin cluster would allow production of 
unlimited amounts of safracin. In addition, the cloned genes could be used 
for combinatorial creation of novel safracin analogs that could exhibit 
improved properties and that could be used in the hemi- synthesis of new 
ecteinascidins. Moreover, cloning and expressing the safracin gene cluster 
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in heterologous systems or the combination of safracin gene cluster with 
other NRPS genes could result in the creation of novel drugs with improved 



activities. 



The present invention provides, in particular, the DNA sequence 
encoding NRPS responsible for biosynthesis of safracin, i.e., safracin 
synthetases. We have characterized a 26,705 bp region (SEQ ID NO:l) 
from Pseudomonas fluorescens A2-2 genome, cloned in pL30P cosmid and 
demonstrated, by knockout experiments and heterologous expression, that 
this region is responsible for the safracin biosynthesis. We expressed the 
PL30P cosmid in two strains of Pseudomonas sp., which do not produce 
safracin, and the result was a production of safracin A and B at levels of a 
22%, for P. fluorescens (CECT 378), and 2%, for P. aeruginosa (CECT 110), 
in comparison with P. fluorescens A2-2 production. The predicted amino 
acids sequences of the various peptides encoded by this DNA sequence is 
shown in SEQ ID NO:2 through SEQ ID NO:15 respectively. 

The gene cluster for safracin biosynthesis derived from P. fluorescens 
A2-2, is characterized by the presence of several open reading frames 
(ORF) that are organized in two divergent operons (Fig. 1), an eight genes 
operon (sacABCDEFGH> and a two genes operon (sacIJ), preceded by well- 
conserved putative promoters regions that overlap. The safracin 
biosynthesis gene cluster is present in only one copy in P. fluorescens A2-2 
genome. 

Our results indicate that the eight genes operon would be 
responsible for the safracin skeleton biosynthesis and the two genes 
operon would be responsible for the final tailoring of safracins. 

In the sacABCDBFGH operon, the deduced amino acid sequences 
encoded by sacA, sacB and sacC strongly resemble gene products of 
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NRPSs. Within the deduced amino acid sequences of SacA, SacB and 
SacC, one peptide synthetase module was identified on each of the ORFs. 

The first surprising feature of the safracin NRPS proteins is that from 
the known active sites and core regions of peptide synthetases (Konz and 
Marahiel, 1999), the first core is poorly conserved in all three peptide 
synthetases, SacA, SacB and SacC (Fig. 2). The other five core regions are 
well conserved in the three safracin NRPSs genes. The biological 
significance of the first core (LKAGA) is unknown, but the SGT(ST)TGxPKG 
(Gocht and Marahiel, 1994; Konz and Marahiel, 1999), the TGD (Gocht 
and Marahiel, 1994; Konz and Marahiel, 1999) and the KIRGxRIEL 
(Pavela-Vrancic et al., 1994; Konz and Marahiel, 1999) core sequences 
could be assigned to ATP binding and hydrolysis. The serine residue of the 
core sequence LGGxS could be shown to be the site of thioester formation 
(D'Souza et al., 1993; Vollenbroich et al., 1993; Konz and Marahiel, 1999) 
and 4'-phosphopantetheine binding (Stein et al., 1994; Konz and Marahiel, 
1999). These findings, together with the fact that safracin seems to be 
synthesized from amino acids, supports the hypothesis that non-ribosomal 
peptide bond formation via the thiotemplate mechanism is involved in the 
biosynthetic pathway of safracin and that sacA, sacB and sacC encode the 
corresponding peptide synthetases. According to this mechanism, amino 
acids are activated as aminoacyl-adenylates by ATP hydrolysis and 
subsequently covalently bound to the enzyme via carboxyl-thioester 
linkages. Then, in further steps, transpeptidation and peptide bond 
formation occurs. 

Secondly, it is striking that our sequence data clearly shows that the 
colinearity rule, according to which the order of the amino acid binding 
modules along the chromosome parallels the order of the amino acids in 
the peptide, does not hold for the safracin synthetase system. According to 
the sequence database homologies and safracin and saframycin structures 



15 



homologies, SacA would be responsible for the recognition and activation 
of the Gly residue and SacB and SacC would be responsible for the 
recognition and activation of the two L-Tyr derivatives that are 
incorporated into the safracin skeleton, while the putative Ala-NRPS gene 
would be missing in the safracin gene cluster. In a few nonribosomal 
peptide synthetases gene clusters, such as in the pristamycin (Crecy- 
Lagard et al, 1997) and in the phosphinothricin tripeptide (Schwartz et al., 
1996) biosynthesis pathways, the first NRPS is not juxtaposed with the" 
second NRPS gene. In concrete, in the pristamycin biosynthetic pathway 
the first structural gene (snbA) and the second structural gene (snbQ are 
130kb apart. This is not the case for the safracin gene cluster where the 
results of the heterologous expression with the pL30P cosmid clearly 
demonstrates that there is no NRPS gene missing since there is 
heterologous safracin production. 

Thirdly, even though the question about the mechanism by which the 
dipeptide Ala-Gly is formed remains open, the presence in sacA of an extra 
C domain at the amino terminus of the first NRPS gene, suggests the 
possibility of a bifuntional adenylation activation activity by this gene. We 
propose that the Ala would be first charged on the phosphopantetheinyl 
arm of SacA (Fig. 3 a* and b*) before being transferred to a waiting 
position, a condensation domain, located in N-terminal of sacA (Pig. 3, c *). 
The Gly adenylate would then be charged on the s'ame 
phosphopantetheinyl arm (Pig. 3, d* and e*), positioned to the elongation 
Site, and elongation would occur (Pig. 3, £*). The arm of the first module 
would at this stage be charged whith a Ala-Gly dipeptide (Pig. 3, g*). We 
proposed that the dipeptide wpuld then be transferred on a waiting 
position in the second phosphopantetheinyl arm (Fig. 3, h*), located in 
SacB, to continue the synthesis of the safracin tetrapeptide basic skeleton. 
An alternative biosynthesis mechanism could be the direct incorporation of 
a dipeptide Ala-Gly into SacA. In this case, the dipeptide could be 
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originated from the activity of highly active peptidyl transferase ribozyme 
family (Sun et al, 2002) or from the activity of bacterial proteolisis. 

And fourthly, although in most of the prokaryotic peptide synthetases 
the thioesterase moiety, which appears to be responsible for the release of 
the mature peptide chain from the enzyme, is fused to the C-terminal end 
of the last amino acid binding module (Marahiel et al., 1997), in the case of 
safracin synthetases, the TE domain is missing. Probably, in the safracin 
synthesis after the last elongation step, the tetrapeptide could be released 
by an alternative strategy for peptide-chain termination that also occurs in 
the saframycin synthesis (Pospiech et al., 1996). This particular 
termination strategy is catalysed by a reductase domain at the carboxy- 
terminal end of the SacC peptide synthetase which catalyses the reductive 
cleavage of the associated T-domain-tethered acyl group, releasing a linear 
aldehyde. 

Our cross feeding experiments indicate that the last two amino acids 
incorporated into the safracin molecule are two L-Tyr derivatives called P2 
(3-hydroxy-5-methyl-0-methyltyrosine) (Pig. 4, 5), instead of two L-Tyr as 
it is proposed to occur in saframycin synthesis. First, the products of two 
genes {sacF and sacG), similar to bacterial methyltransferases, have shown 
to be involved in the O, C-methylation of L-Tyr to produce PI 4 (3-methyl- 
O-methyltyrosine) , precursor of P2. A possible mechanism could envisage 
that the O-methylation occurs first and then the C-methylation of the 
amino acid derivative is produced. Secondly, P2, the substrate for the 
peptide synthetases SacB and SacC, is formed by the hydroxylation of P14 
by SacD (Fig. 4, 5). 



OMe 





NH 2 



NH 2 



CnH^NOa 
Exact Mass: 209,1 1 
Mol. Wt.: 209,24 
C. 63,14; H, 7,23; N, 6,69; O, 22,94 



C 11 H 15 N0 4 
Exact Mass: 225,10 
Mol. Wt.: 225,24 
C, 58,66; H, 6,71; N, 6,22; O, 28,41 
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P-14 p_ 2 



Apart from the safracin biosynthethic genes, in the sacABCDEFGH 
operon there are also found two genes, sacB and sacH, involved in an 
unknown function and in the safracin resistance mechanism, respectively. 
We have demonstrated that sacH gene codes for a protein that when is 
heterologous expressed, in different Pseudomonas strains, a highly 
increase of the safracin B resistance is produced. SacH is a putative 
transmembrane protein, that transforms the C21-OH group of safracin B 
into a C 2 i-H group, to produce safracin A, a compound with less antibiotic 
and antitumoral activity. Finally, even though still is unknown about the 
putative function of SacE, homologous of this gene have been found close 
to various secondary metabolites biosynthetic gene clusters in some 
microorganisms genomes, suggesting a conserved function of this genes in 
secondary metabolite formation or regulation. 

In the sac/J operon, the deduced amino acid sequences encoded by sad 
and sacJ strongly resemble gene products of methyltransferase and 
hydroxylase/monoxygenase, respectively. Our data reveals that Sad is the 
enzyme responsible for the iV-methylation present in the safracin 
structure, and that SacJ is the protein which makes an additional 
hydroxylation on one of the L-Tyr derivative incorporated into the 
tetrapeptide to produce the quinone structure present in all safracin 
molecules. JV-Methylation is one of the modifications of nonribosomally 
synthesized peptides that significantly contributes to their biological 
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activity. Except for saframycin (Pospiech et aL, 1996), that is produced by 
bacteria and is i\T-methylated, all the JV-methylated nonribosomal peptides 
known are produced by fungi or actinomycetes and, in most of the cases, 
the responsible for the iV-methylation is a domain which reside in the 
nonribosomal peptide synthetase. 



Table I. Sumary of the safracin biosynthetic genes and its predicted 
functions. 



ORF 
name 


Protein name 


Proposed function 


Amino acids 


Molecular 
weight 


sacA 


SacA 


Peptide synthetase 


1004 


110,45 


sacB 


SacB 


Peptide synthetase 


1063 


117,5 


sacC 


SacC 


Peptide synthetase 


1432 


157,3 


sacD 


SacD 


L-Tyr derivative 


350 


39,18 






hidroxylase 






sacB 


SacE 


Unknown 


61 


6,73 


sacF 


SacF 


L-Tyr derivative 


355 


39,8 






methylase 






sacG 


SacG 


L-Tyr O-methylase 


347 


38,3 


sacH 


SacH 


Resistance 


180 


19,63 


sad 


Sad 


methyl-transferase 


220 


24,22 


sacJ 


SacJ 


monooxygenase 


509 


55,34 



The safracin putative synthetic pathway, with indications of the specific 
amino acid substrates used for each condensation reaction and the 
various post-condensation activities, is shown in Fig. 5* 



To further evaluate the role of safracin biosynthetic genes, we 
constructed knock out mutants of each of the genes of the safracin cluster 
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(Fig. 6). The disruption of the NRPSs genes (sacA, sacB and sacC ) as well 
as sacD, sacF and sacG, resulted in safracin and P2 non producing 
mutants. Our results indicate that the genes from sacA to sacHare part of 
the same genetic operon. As a consequence of the sad and saoj gene 
disruptions three new molecules were originated, P19B, P22A and P22B 
(Fig. 6). 




The production of P22A and P22B (Fig. 7a*) by sacJ mutant 
demonstrated that the role of the SacJ protein is to produce the additional 
hydroxylation of the left L-Tyr derivatives amino acid of the safracin, the 
one involved in the quinone ring. The production of P19B (Fig. 7b*) by sad 
mutant, a safracin like molecule where the JV-methylation and the quinone 
ring are missing, confirms that Sad is the N-methyltransferase enzyme 
and suggests that sacIJ is a transcriptional operon. The production of 
P19B also by sacJ mutant (Pig. 7a*) suggests that probably the N- 
methylation occurs after the quinone ring has been formed. Even though 
these new structures have no interesting antimicrobial activity on B. 
subtilis or no high citotoxic activity on cancer cells, they can serve as 
interesting new precursors for the hemisynthesis of new active molecules. 
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As far as structure activity is concerned, the observation that P19B, P22A 
and P22B appear to loose their activity, suggests that the lost of the 
quinone ring from the safracin structure is directly related with the lost of 
activity of the safracin family molecules. 

The disruption of sad gene with the reconstitution of the sacJ gene 
expression resulted in the production of two new safracins. The two 
antibiotics produced, at levels of production as high as the levels of 
safracin A/ safracin B production in the wild type strain, have been named 
as safracin D and safracin E (Fig* 7c*). 



The safracin D and safracin E are safracin B and safracin A like 
moledules, respectively, where the N-methylation is missing. Both, safracin 
D and safracin E have been shown to possess the same antibacterial and 
antitumoral activities as safracin B and safracin A, respectively. Apart 
from its high activities properties, antibacterial and antitumoral, safracin 
D could be used in the hemi-synthesis of the ecteinascidin ET-729, a 
potent antitumoral agent, as well as in the hemi-synthesis of new 
ecteinascidins. 

A question arises concerning the role of the aminopeptidase-like protein 
coded by a gene located at 3'site of the safracin operon. The insertional 
inactivation of orfl (PM-S1-14) showed no effect on safracin A/ safracin B 
production. Because of its functionality properties it remains unclear if 
this protein could play some role in the safracin metabolism. The other 
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21 

genes present in the pL30P cosmid (orf2 to orf4) will have to be studied in 
more detail. 

Another aspect of the invention is that provides the tools necessary for 
the production of new specific designed "unnatural" molecules. The 
addition of a specific modified P2 derivative precursor named P3, a 3- 
hydroxy-5-methyl-O-methyltyrosine, to the sacE mutant yields two 
"unnatural" safracins that incorporated this specific modified precursor, 
safracin A(OEt) and safracin B(OEt) (Pig. 8). 




C3oH4 0 N40 6 

Exact Mass: 552,29 
Mol. Wt: 552,66 
C, 65,20; H, 7,30; N, 10,14; O, 17,37 




C30H40N4O7 
Exact Mass: 568,29 
Mol. Wt: 568,66 
C, 63.36; H, 7,09; N, 9,85; 0, 19,< 



safracin A(OEt) 



safracin B(OEt) 



The two new safracins are potent antibiotic and antitumoral 
compounds. The biological activities of safracin A(OEt) and Safracin B(OEt) 
are as potent as the activities of safracin A and safracin B, respectively. 
These new safracins could be the source for new potent antitumoral 
agents, as well as a source of molecules for the hemi-synthesis of new 
ecteinascidins. 



In addition, the genes involved in safracin synthesis could be combined 
with other non ribosomal peptide synthetases genes to result in the 
creation of novel "unnatural" drugs and analogs with improved activities. 
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EXAMPLES 

Example 1: Extraction of nucleic acid molecules from Pseudomonas 
fluorescens A2-2 

Bacterial strains 

Strains of Pseudomonas sp. were grown at 27°C in Luria-Bertani (LB) 
broth (Ausubel et at, 1995). E. coli strains were grown at 37°C in LB 
medium. Antibiotics were used at the following concentrations: ampicillim 
(50 ng/ml), tetracycline (20 ng/ml) and kanamycin (50 ^ig/ml). 



Table II. Strains used in this invention. 



Code 


Genotype 


PM-S 1-001 


P. fluorescens A2-2 wild type 


PM-S 1-002 


sacA- 


PM-S1-003 


sacB- 


PM-S 1-004 


sacC- 


PM-S 1-005 


sacJ- 


PM-S 1-006 


sad- 


PM-S 1-007 


sad- with sacJ expression reconstitution 


PM-S 1-008 


sacF- 


PM-S 1-009 


sacG- 


PM-S1- 010 


sacl> 


PM-S1- 014 


orfl- 


PM-S1- 015 


A2-2 + pLAFR3 


PM-S1- 016 


A2-2 + pL30p 


PM-19- 001 


P. fluorescens CECT378 + pLAFR3 


PM-19- 002 


P. fluorescens CECT378 + pL30p 


PM-19- 003 


P. fluorescens CECT378 + pBBRl-MCS2 


PM-19- 004 


P. fluorescens CECT378 + pB5H83 



e 



PM-16- 001 
PM-16- 002 
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PM- 19- 005 P. fluorescens CECT378 + pB7983 ' 
PM- 19- 006 P. fluorescens CECT378 + pBHPT3 



P. aeruginosa CECT1 10 + pLAFR3 
P. aeruginosa CECT1 10 + pL30p 



PM- 17- 003 P. putida ATCC12633+ pBBRl-MCS2 

PM- 1 7- 004 P. putida ATCC 1 2633+ pB5H83 

PM-17- 005 P. putida ATCC12633+ pB7983 

PM- 18- 003 P. stutzeri ATCC17588+ pBBRl-MCS2 

PM-18- 004 P. stutzeri ATCC17588+ pB5H83 

PM- 1 8- 005 P. stutzeri ATCC 1 7588+ pB7983 



DNA manipulation 

Unless otherwise noted, standard molecular biology techniques for in 
vitro DNA manipulations and cloning were used (Sambrook et aL, 1989). 

DNA extraction 

Total DNA from Pseudomonas fluorescens A2-2 cultures was 
prepared as reported (Sambrook et al., 1989). 

Computer analysis 

Sequence data were compiled and analysed using DNA-Star software 
package. 

Example 2; Identification of NRPS p e nes resp on sible for saTr*^ 
production in Ps eudomonas fluorescens A2-2. 

Primer design 

Marahiel et al. (Marahiel et al. 1997) previously reported highly 
conserved core motifs of the catalytic domains of cyclic and branched 
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peptide synthetases. Based on multiple sequence alignments of several 
reported peptide synthetases the conserved regions A2, A3, A5, A6, A7 and 
A8 of adenylation and T of thiolation modules were targeted for the 
degenerate primer design (Turgay and Marahiel, 1994). The wobble 
positions were designed in respect to codon preferences within the selected 
modules and the expected high G/C content of Pseudomonas sp. All 
oligonucleotides were obtained from ISOGEN (Bioscience BV). A PCR 
fragment was obtained when degenerate oligonucleotides derived from the 
YGPTE (A5 core) and LGGXS (T core) sequences were used. These 
oligonucleotides were denoted PS34-YG and PS6-FF, respectively. 



Table III. PCR primers designed for this study. 



Primer designation 
and 

orientation 


Sequence 


Length 


PS34-YG (forward) 


5'- TAYGGNCCNACNGA -3' 


14-rner 


PS6-FF (reverse) 


5 -TSNCCNCCNADNTCRAARAA-3 9 


20-mer 



PCR conditions for amplification of DNA from P. fluorescens A2-2 

A fragment internal to nonribosomal peptide synthetases (NRPS) was 
amplified using PS-34-YG and PS6-FF oligonucleotides and P. fluorescens 
A2-2 chromosomal DNA as template. Reaction buffer and Taq polymerase 
from Promega were used. The cycling profile performed in a Personal 
thermocycler (Eppendorf) consists on: 30 cycles of 1 min at 95°C, 1 min at 
50°C, 2 min at 72°C. PCR products were on the expected size (750 bp 
aprox.) based on the location of the primers within the NRPS domains of 
other synthetase genes. 
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DNA cloning 

PCR amplification fragments were cloned into pGEM-Teasy vector 
according to the manufacturer (Qiagen, Inc., Valencia, CA). In this way, 
cloned fragments are flanked by two EcoRI restriction sites, in order to 
facilitate subsequent subclonig in other plasmids (see below). Since NRPSs 
enaymes are modular, clones from the degenerated PCR primers 
represents a pool of fragments from different domains. 

DNA sequencing 

All sequencing was performed using primers directed against the 
cloning vector, with an ABI Automated sequencer (Perkin-Elmer). Cloned 
DNA sequences were identified using the BLAST server of the National 
Center for Biotechnology Information accessed over the Internet (Altschul 
et al., 1997). All of the sequences have signature regions for NRPSs and 
show high similarity in BLAST searches to bacterial NRPS showing that 
they are in fact of peptide origin. Moreover, a probable domain similarity 
search was performed using the PROSITE (European Molecular Biology 
Laboratory, Heidelberg, Germany) web server. 

Gene disruption of Pseudomonas fluorescens A2-2 

In order to analyse the function of the genes cloned, these genes 
were disrupted through homologous recombination (Fig. 9). For this 
purpose, recombinant plasmids (pG-PS derivatives) harboring the NRPS 
gene fragment were digested with EcoM restriction enzyme. The resulting 
fragments belonging to the gene to be mutated were cloned into the 
pKlSmob mobilizable plasmid (Schafer et al, 1994), a chromosomal 
integrative plasmid able to replicate in B. coli but not in Pseudomonas 
strains. Recombinant plasmids were introduced first in B. coli S17-XPIR 
strain by transformation and then in P. fluorescens A2-2 through 
biparental conjugation (Herrero et al, 1990). Different dilutions of the 
conjugation were plated onto LB solid medium containing ampicillim plus 
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kanamycin and incubated overnight at 27°C. Kanamycin-resistant 
transconjugants, containing plasmids integrated into the genome via 
homologous recombination, were selected. 

Biological assay (biotest) for safracin production 

Strains P. fluorescens A2-2 and its derivatives were incubated in 50 
ml baffled erlenmeyer flasks containing fermentation medium with the 
corresponding antibiotics. Initially, SA3 fermentation medium was used 
(Ikeda Y., 1985). In order to increase the productivity of the fermentation 
process statistical-mathematical methods like Hackett-Burman designed 
was used to select nutrients and response surface optimization techniques 
were tested (Hendrix C, 1980) in order to determine the optimum level of 
each key independent variable. Experiments to improve the culture 
conditions like incubation temperature and agitation have also been done. 
Finally a highly safracin B producer medium named 16B (152 g/1 of 
mannitol, 35g/l of G20-25 yeast, 26 g/1 of CaCOa , 14 g/1 of amonium 
sulfate, 0.18 g/1 of cloruro ferrico, pH 6.5) was selected. 

The safracin production was assay testing the capacity of inhibition 
a Bacillus subtilis solid culture by 10 ^1 of the supernatant of a 3 days 
Pseudomonas sp. culture incubated at 27°C (Alijah et al., 1991). R 
fluorescens A2-2 cultures produce inhibition zones of 10-14 mm diameter 
while non-producing mutants did not inhibit B. subtilis growth. Three 
isolated clones had the safracin biosynthetic pathway affected. In order to 
confirm the results, HPLC analysis of safracin production was performed. 

HPLC analysis of safracin production. 

The supernatant was analysed by using HPLC Symmetry C-18. 
300A, 5 urn , 250 x 4.6 mm column (Waters) with guard-column 
(Symmetry C-18, 5 M m 3.9 x 20 mm, Waters). An ammonium acetate buffer 
(10 mM, 1% Diethanolamine, pH 4.0)- acetonitrile gradient was the mobile 
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phase. Safracin was detected by absorption at 268 nm. In Pig. 6, HPLC 
profile of safracin and safracin precursors produce by P. fluoresceins A2-2 
strain and different safracin-like structures produced by P. fluorescens 
mutants are shown. 



Example 3. Cloning and seq u ence analysis of safracin cluster 

Inverse PCR and phage library hybridization 

Southern hybridization' on mutant chromosomal DNAs verified the 
correct gene disruption and demonstrated that the peptide synthethase 
fragment cloned into pK18mob plasmid was essential for the production of 
safracin. Analysis of the non safracin producers mutants obtained 
demonstrated that all of them had a gene disruption into the same gene, 
sacA. 

Inverse PCR from genomic DNA and screening of a phage library of 
P. fluorescens A2-2 genomic DNA revealed the presence of additional genes 
flanking sacA gene, probably involved in safracin biosynthesis. 

The GenBank accession number for the nucleotide sequence data of 
the P. fluorescens A2-2 safracin biosynthetic cluster is AY061859. 

Cosmid library construction and heterologous expression 

To determine whether safracin cluster was able to confer safracin 
biosynthetic capability to a non producer strain, it was cloned into a wide 
range cosmid vector (pLAFR3, Staskawicz B., et al 1987) and conjugated 
to a different Pseudomonas sp collection strains. 

To obtain a clone containing the whole cluster, a cosmid library was 
constructed and screened. For this purpose, chromosomal DNA was 
partially digested with the restriction enzyme Psfl, the fragments were 
dephosphorylated and ligated into the Psfl site of cosmid vector pLAFR3. 
The cosmids were packaged with Gigapack III gold packaging extracts 
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(Stratagene) as manufacturer's recommendations. Infected cells of strain 
XLl-Blue were plated on LB-agar supplemented with 50 ng/ml of 
tetracycline. Positives clones were selected using colony hybridization with 
a DIG-labeled DNA fragment belonging to the 3'-end of the safracin cluster. 
In order to ensure the cloning of the whole cluster, a new colony 
hybridization with a 5'-end DNA fragment was done. Only cosmid pL30p 
showed multiple hybridizations with DNA probes. To confirm the accurate 
cloning, PCR amplification and DNA-sequencing with DNA oligonucleotides 
belonging to the safracin sequence were carried out. The size of the insert 
of pL30P was 26,705 bp . The pL30p clone DNA was transformed into E. 
coli S17XPIR and the resulting strain were conjugated with the 
heterologous Pseudomonas sp. strains. The pL30p cosmid was introduced 
into P. fluoresceins CECT378 and P. aeruginosa CECT110 by biparental 
conjugation as described above. Once a clone encoding the whole cluster 
was identified, it was determined whether the candidate was capable of 
producing safracin. Safracin production in the conjugated strains was 
assessed by HPLC analysis and biological assay of broth cultures 
supernatants as previously described. 

The strain P. fluorescens CECT378 expressing the pL30p cosmid 
(PM- 19-002) was able to produce safracin in considerable amounts, 
whereas safracin production in P. aeruginosa CECT110 strain expressing 
pL30P (PM-16-002) was 10 times less than the CECT378. Safracin 
production in these strains was about 22 % and 2 % of the total 
production in comparison with the natural producer strain. 

Genes involved in the formation of safracin. Sequence analysis of 
sacABCDEFGH and sacIJ qperons 

Computer analyses of the DNA sequence of pL30P revealed 14 ORFs 
(Fig. 1). A potential ribosome binding site precedes each of the ATG start 
codons. 
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In the sacABCDEFGH operon, three very large ORFs, sacA, sacB and 
sacC (positions 3052 to 6063, 6080 to 9268 and 9275 to 13570 of the P. 
ftuorescens A2-2 safracin sequence SEQ ID NO:l, respectively) can be read 
in the same direction and encode the putative safracin NRPSs: SacA (1004 
amino acids, M r 110452), SacB (1063 amino acids, M r 117539) and SacC 
(1432 amino acids, M r 157331). The three NRPSs genes contain the 
domains resembling amino acid activating domains of known peptide 
synthetases. Specifically, the amino acid activating domains from these 
NRPS genes are very similar to three of the four amino acid activating 
domains (Gly, Tyr and Tyr) found in the Myxococcus xanthus saframycin 
NRPSs (Pospiech et al., 1995, Pospiech et al., 1996). In particular, SacA 
(SEQ ID NO:2) shows 33% identity with saframycin Mxl synthetase B 
protein (SafB) from M. xanthus (NCBI accession number U24657), whereas 
SacB (SEQ ID NO:3) and SacC (SEQ ID NO:4) share, respectively, 39% and 
41% identity with saframycin Mxl synthetase A (SafA) from M. xanthus 
(NCBI accession number U24657). The Fig. 2 shows a comparison among 
SacA, SacB y SacC and the different amino acid activating domains of 
saframycin NRPS. 

Downstream sacC five small ORFs reading in the same direction as 
the NRPSs genes exist (Fig.l). The first one, sacD (position 13602 to 14651 
of R ftuorescens A2-2 safracin sequence), encodes a putative protein, SacD 
(350 amino acids, M r 39187; SEQ ID NO:5), with no similarities in the 
GeneBank DB. The next one, sacE (position 14719 tol4901 of P. 
ftuorescens A2-2 safracin sequence), encodes a small putative protein 
called SacE (61 amino acids, M r 6729; (SEQ ID NO:6)), which shows some 
similarity with proteins of unknown function in the databases (ORF1 from 
Streptomyces viridochromogenes (NCBI accession number Y17268; 44% 
identity) and MbtH from Mycobacterium tuberculosis (NCBI accession 
number Z95208; 36% identity). The third ORF, sacF (position 14962 to 
16026 of P. ftuorescens A2-2 safracin sequence), encodes a 355-residue 
protein with a molecular weigh calculated of 39,834 (SEQ ID NO:7). This 
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protein most closely resembles hydroxyneurosporene methyltransferase 
(CrtF) from Chloroflexus aurantiacus (NCBI accession number AF288602; 
25% identity). The nucleotide sequence of the fourth ORF, sacG (position 
16115 to 17155 of P. fluorescens A2-2 safracin sequence), predicted a gene 
product of 347 amino acids having a molecular mass of 38,22 kDa (SEQ 
ID NO:8). The protein, called SacG, is similar to bacterial O- 
methyltransferases, including O-dimethylpuromycin-O-methyltransferase 
(DmpM) from Streptomyces anvlatus (NCBI accession number P42712; 
31% identity). A computer search also shows that this protein contains the 
three sequence motifs found in diverse S-adenosylmethionine-dependent 
methytransferases (Kagan and Clarke, 1994). The fifth gene, sacH (position 
17244 to 17783 of P. fluorescens A2-2 safracin sequence), encodes a 
putative protein SacH (180 amino acids, M r 19632; (SEQ ID NO:9). A 
computer search for similarities, between the deduced amino acid 
sequence of SacH and other protein sequences, revealed identity with some 
conserved hypothetical proteins of unknown function, which contains a 
well conserved transmembrane motif and a dihydrofolate reductase-like 
active site (Conserved hypothetical protein from Pseudomonas aeruginosa 
PAOl, NCBI accession number P3469; 35% identity). 

Upstream sacABCDEFGH operon, reading in opposite sense, a two 
genes operon, sacU, is located. The sad gene (position 2513 to 1854) 
encodes a 220-amino acids protein (M r 24219; (SEQ ID NO: 10) that most 
closely resembles ubiquinone/manequinone methyltrasnferase from 
Thermotoga maritime (NCBI accession number AE001745; 32% identity). 
The sacJ gene (position 1861 to 335) encodes a 509-amino acid protein 
(SEQ ID NO: 11), with a molecular mass of 55341 Da, similar to bacterial 
monooxygenases/hydroxylases, including putative monooxygenase from 
Bacillus subtilis (NCBI accession number Y 14081; 33% identity) and 
Streptomyces coelicolor (NCBI accession number AL1 09972; 29% identity). 
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SacABCDEFGH and sacIJ operons are transcribed divergently and 
are separated by 450 bp approximately. Both operons are flanked by 
residual transposase fragments. 



Related safradn cluster genes 

A putative ORF (or/3; position 18322 to 19365 of P. fluorescens A2-2 
safracin sequence) located at the 3'-end of the safracin sequence has been 
found (Pig. 1). ORFl protein (SEQ ID NO:12) shows similarity with 
aminopeptidases from the Gene Bank DataBase (peptidase M20/M25/M40 
family from Caulobacter crescentus CB15; NCBI accession number 
NP422131; 30% identity). Using the strategy described in Example 2 the 
gene disruption of orfl do not affect safracin production in P. fluorescens 



A2-2. 



At the 3'-end of the safracin sequence cloned in pL30p cosmid, three 
putative ORFs (orf2 } or/3 and or/4), were found. Reading in opposite 
direction than sacABCDEFGH operon, or/2 gene (position 22885 to 21169 
of SEQ ID NO:l) codes for a protein, ORF2 (SEQ ID NO: 13), with 
similarities to Aquifex aeolicus HoxX sensor protein (NCBI accession 
number NC000918.1; 35% identity), whereas orf3 gene (position 23730 to 
23041 of SEQ ID NO:l) codes for ORFS protein (SEQ ID NO: 14) which 
shares 44% identity with a glycosil transferase related protein from 
Xanthomonas axonopodis pv. Citri str. 306 (NCBI accession number 
NP642442). 

The third gene is located at the 3'-end of SEQ ID NO:l (position 
25037 to 26095). This gene, named orf4 (position 2513 to 1854), encodes a 
protein, ORF4 (SEQ ID NO:15), that most closely resembles to a 
hypothetical isochorismatase family protein YcdL from Escherichia coll 
(NCBI accession number P75897; 32% identity). 
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Presumably, these three genes would not be involve in the safracin 
biosynthetic pathway, however, future gene disruption of these genes will 
confirm this assumption. 

The different DNA sequences found are listed at the end of the 
description. 



Example 4. Fun ctional analysis of the safracin loci and search for 
possible precursors 

Since the pathway for synthesis of safracin in P. fluorescens A2-2 is 
at present unknown, the inactivation of each of the genes described in 
Example 3 would permit fundamental studies on the mechanism of 
safracin biosynthesis in this strain. 

In order to analyze the functionality of each particular protein in the 
safracin production pathway, disruption of each particular gene of the 
cluster, but sacE, was performed. All of the genetic mutants were obtained 
following the disruption strategy previously described. 

Figure 6 is a summary of the different mutants constructed in this 
invention as well as a summary of the compounds produced by the 
mutants as a consequence of the gene disruption. In the wild type strain 
both safracin A and B and other compounds, P2 and PI 4, were clearly 
detected by HPLC (see Pig. 6,WT). The gene disruption of the sacA (PM-S1- 
002), sacB (PM-S1-003), sacC (PM-S1-004), sacD (PM-S1-010), sacF (PM- 
Sl-008), and sacG (PM-S1-009), genes generated mutants that were 
unable to produce neither safracin A and safracin B, nor the precursor 
compounds with retention times beneath 15 min, P2 and PI 4 respectively. 
The structure elucidation of P14 and P2 revealed that P14 is a 3-methyl-O 
methyl tyrosine, where as P2 is a 3-hydroxy-5-methyl- O-methyl tyrosine. 



e 
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Because of the small size of the sacE gene, the sacE> mutant was not 
possible to be obtained by gene disruption, but deletion of this gene is in 
process. The overexpression of SacE protein, in trans, had no effect on 
safracin B/A production. The sad mutants (PM-S1-006) produced P2, P14 
and significant amount of a compound called P19B (Fig. 6; Pig7b*). 
Structure elucidation of P19B revealed that this compound is a safracin- 
like molecule in which the JV-Met and one of the OH from the quinone ring 
are missing. In the sacJ mutants (PM-S1-005), P2, P14, P19B and two 
new compounds called P22A and P22B were obtained (Fig. 6; Fig. 7a*). 
Structure elucidation of P22A and P22B revealed that they are safracin A 
and safracin B like molecules, respectively, without one of the -OH group 
from the quinone ring. The biological assay of the sad and the sacJ 
mutants extracts revealed very low activity against Bacillus subtilis. 

The disruption of sad gene with the reconstitute of the sacJ gene 
expression resulted in a new safracins producer mutant, PM-S1-007. The 
two antibiotics produced, at levels of production as high as the levels of 
safracin A and safracin B in the wild type strain, have been named as 
safracin D and safracin E (Fig. 7c*). The safracin D and safracin E are 
safracin B and safracin A like molecules, respectively, where the N- 
methylation is missing. 

These results strongly suggest that i) sacA, sacB and sacC genes 
encode for the safracin NRPSs; ii) sacD, sacF and sacG genes are 
responsible for the transformation of L-Tyr into the L-Tyr derivative P2 and 
iii) sad and sacJ are responsible for the tailoring modifications that 
convert P19 and P22 into safracin. 



Characterization of Natural Prfirnrsnrc- 
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P-14 



OMe 




COOH 



Me 



NH 2 

CnH 15 N0 3 
Exact Mass: 209,11 
Mol. Wt: 209,24 
C, 63,14; H t 7,23; N, 6,69; O, 22,94 



Pseudomonas fluorescens A2-2 (wild type) (PM-S 1-001) 

Fermentation conditions: 

Seed medium YMP3 containing 1% glucose; 0.25% beef extract; 0.5% 
bacto-peptone; 0,25% NaCl; 0,8% CaC03 was inoculated with 0,1% of a 
fro2;en vegetative stock of the microorganism, and incubated on a rotary 
shaker (250 rpm) at 27°C. After 30h of incubation, the 2% (v/v) seed 
culture was transferred into 2000 ml Erlenmeyer flasks containing 250 ml 
of the M-16B production medium, composed of 15,2 % mannitol; 3,5 % 
Dried brewer's yeast; 1,4 % (NH 4 ) 2 S0 4 ; 0,001%; FeCl 3 ; 2,6 % C0 3 Ca. The 
temperature of the incubation was 27°C from the inoculation till 40 hours 
and then, 24°C to final process (71 hours). The pH was not controlled. The 
agitation of the rotatory shaker was 220 rpm with 5 cm eccentricity. 



After 71 hours of incubation, 2 Erlenmeyer flasks were pooled and 
the 500 ml of fermentation broth was clarified by 7.500 rpm centrifugation 
during 15 minutes. 50 grams of the resin XAD-16 (Amberlite) were added 
to the supernatant and mixed during 30 minutes at room temperature. 



Strain: 



Isolation: 
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Then, the resin was recovered from the clarified broth by filtration. The 
resin was washed twice with destilled water and extracted with 250 ml of 
isopropanol (2-PrOH). The alcohol extract was dried under high vacuum 
nil obtention of 500 mg crude extract. This crude was dissolved in 
methanol and purified by chromatographic column using Sephadex LH-20 
and methanol as mobile phase. The P-14 compound was eluted and dried 
as a 15 mg yellowish solid. The purity was tested by anafirycal HPLC and 
l H NMR. 

P-14 was also isolated in a similar way from cultures of the sacJ- mutant 
(PM-S1-005), using semipreparative HPLC as the last step in the 
purification process. 

Biological activities: 
NO ACTIVE 

Spectroscopic data: 

ESMS m/z 254 (CuHwNOaNaa*), 232 (CuHisNOsNa-), 210 (M+H + ). m RMN 
(300 MHz, CD3OD): 7.07 (d, J"=8.1 Hz, H-9), 7.06 (s, H-5), 6.84 (d, JN3.1 
Hz, H-8), 3.79 (s, H-ll), 3.72 (dd, J=8.7, 3.9 Hz, H-2), 3.20 (dd, ,>14.4, 
3.9 Hz, H-3a), 2.91 (dd, J=14.4, 8.9 Hz, H-3b), 2.16 (s, H-10). ™C RMN (75 
MHz, CD 3 OD): 174.1 (C-l), 158.6 (C-7), 132.5 (C-5), 128.9 (C-9), 128.5 (C- 
4), 128.0 (C-6), 111.4 (C-8), 57.6 (C-2), 55.8 (C-ll), 37.4 (C-3), 16.3 (C-10) 



P-2 
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OMe 




CnH 15 N0 4 
Exact Mass: 225,10 
Mol. WL: 225,24 
C, 58,66; H, 6,71; N, 6,22; O, 28,41 

Strain: 

Pseudomonas fluorescens A2-2 (wild type) (PM-S1-001) 

Fermentation conditions: 

The same process than P-14 

Isolation: 

Similar procedure as the P-14, except in the Sephadex 
chromatography, where the fractions containing P-2 have eluted later. A 
semi-preparative HPLC step (Symmetry Prep C-18 column, 7.8 x 150 mm, 
AcONH 4 10 mM pH=3/CH 3 CN 95:5 held for 5 min and then gradient from 
5 to 6.8 % of CH3CN in 3 min) has been necessary to purify the P-2. 
Also this compound has been isolated from the fermentation broth of the 
Pseudomonas putida ATCC12633+pB5H83 (PM- 17-004) as result of 
heterologous expression. 

Biological activities: 
NO ACTIVE 



Spetroscopic data: 



e 
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ESMS m/z226 [M+H] + ; *H RMN (CD3OD, 300 MHz): 6.65 (d, J= 1.8 Hz, 
H-5), 6.59 (d, J= 1.8 Hz, H-9), 3.72 (s, H-ll), 3.71 (dd, J= 9.0, 4.2 Hz, H- 
2), 3.16 (dd, J= 14.4, 4.2 Hz, H-3a), 2.83 (dd, J= 14.4, 9.0 Hz, H-3b), 2.22 
(s, H-10); i3 C RMN (DMSO, 75 MHz): 170.88 (s, C-l), 150.025 (s, C-7), 
144.56 (s, C-8), 132.28 (s, C-4), 130.36 (s, C-6), 121.73 (d, C-5), 115.55 (d,' 
C-9), 59.06 (q, 7-OMe), 55.40 (d, C-2), 36.21 (t, C-3), 15.86 (q, 6-Me). 



Characterization of Safracins like c ompounds obtained by knock out 
COMPOUND P-22B 



OMe 




C28H38N4O6 
Exact Mass: 526.28 
Mol. Wt.: 526,62 
C, 63,86; H, 7,27; N, 10,64; O, 18,23 



Strain: 



sac J- mutant from P.fluorescens A2-2 (PM-S 1-005) 



Fermentation conditions: 

50 litres of the SAM-7 medium (50 1) composed of dextrose (3,2%), 
mannitol (9,6%), dry brewer's yeast (2%), ammonium sulphate (1,4%), 
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potassium secondary phosphate (0,03%), potassium chloride (0,8%), Iron 
(III) chloride 6-hydrtate (0,001%), L-tyrosine (0,1%), calcium carbonate 
(0,8%), poly- (propylene glycol) 2000 (0,05%) and antifoam ASSAF 1000 
(0,2%) was poured into a jar-fermentor (Bioengineering LP-351) with 75 1 
total capacity and, after sterilization, sterile antibiotics (amplicillin 0,05 g/1 
and kanamycin 0,05 g/1) were added. Then, it was inoculated with seed 
culture (2%) of the mutant strain PM-S1-005. The fermentation was 
carried out during 71 h. under aerated and agitated conditions (1.0 
1/1/min and 500 rpm). The temperature was controlled from 27°C (from 
the inoculation till 24 hours) to 25°C (from 24h to final process). The pH 
was controlled at pH 6,0 by automatic feeding of diluted sulphuric acid 
from 22 hours to final process. 

Isolation 

The whole broth was clarified (Sharpies centrifuge). The pH of the 
clarified broth was adjusted to pH 9,0 by addition of NaOH 10% and 
extracted with 25 litres of ethyl acetate, after 20* mixing, the two phases 
were separated . The organic phase was frozen overnight and then, filtered 
for removing ice and evaporated to a greasy dark green extract (65,8 g). 
This extract was mixed with 500 ml hexane (250 ml two times) and filtered 
for removing hexane soluble impurities. The remaining solid, after drying, 
gave a 27,4 g of a dry green-beige extract. 

This new extract was dissolved in methanol and purified by a Sephadex 
LH-20 chromatography (using methanol as mobile solvent) and the 
safracins-like compounds were eluted in the central fractions (Analyzed on 
TLC conditions: Silica normal phase, mobile phase: BtOAc:MeOH 5:3. Aprox. 
Rf valor: 0,3forP-22B, 0,25 P-22A and 0,1 for P- 19). 

The pooled fractions, (7,6g) containing the three safracin-like compound 
were purified by a Silica column using a mixture of EtOAc:MeOH from 
50:1 to 0:1. and other chromatographic system (isocratic 
CHCl3:MeOH:H 2 0:AcOH 50:45:5:0.1). Compounds P22-A, P22-B and P19- 
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B were purified by reversed-phase HPLC (SymmetryPrep C-18 column 150 
x 7.8 mm, 4 mL/min, movile phase: 5 min MeOH:H 2 0 (0.02 % TFA) 5:95 
and gradient from MeOH:H 2 0 (0.02 % TFA) 5:95 to MeOH 100 % in 30 
min). 

Biological activities ofsqfracin P-22B 




Antimicrobial activy: On solid medium 

Bacillus subtitis. lO^g/disk (6mm diameter): 10 mm ihibition zone 
Spectroscopic data: 

HRFABMS m/z 509.275351 [M-H 2 0+H] + (calcd for CasHay^Os 509.276396 
□ 1.0 mmu); LRFABMS using m-NBA as matrix m/z (rel intensity) 509 [M- 
H a O+H] + (5), 460 (2.7), 391 (3). 

iH NMR (CD 3 OD, 500 MHz): 6.70 (s, H-15), 6.52 (s, H-5), 4.72 (bs, H-ll), 
4.66 (d, J= 2.0 Hz, H-21), 4.62 (dd, ,7= 8.4, 3.7 Hz, H-l), 3.98 (bd, J= 7.6 
Hz, H-13), 3.74 (s, 7-OMe), 3.71 (s, 17-OMe), 3.63 (m, overlapped signal, 
H-25), 3.62 (m, overlapped signal, H-3), 3.30 (m, H-22a), 3.29 (m, H-14a), 
3.18 (d, J= 18.6 Hz, H-14b), 2.90 (m, H-4a), 2.88 (m, H-22b), 2.76 (s, 12- 
NMe), 2.30 (s, 16-Me), 2.22 (m, H-4b), 1.16 (d, J= 7.4 Hz, H-26); 
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13C NMR (CD3OD, 125 MHz): 170.75 (s, C-24), 149.24 (s, C-18), 147.54 (s, 
C-8), 145.95 (s, C-7), 145.82 (s, C17), 133.93 (s, C-16), 132;31 (s, C-9), 
131.30 (s, C-6), 128.95 (s, C-20), 121.93 (d, C-15), 121.76 (d, C-5), 121.44 
(s, C-10), 112.45 (s, C-19), 92.87 (d, C-21), 60.86 (q, 7-OMe), 60.76 (q, 17- 
OMe), 59.39 (d, C-ll), 57.96 (d, C-13), 55.51 (d, C-l), 54.29 (d, C-3), 50.08 
(d, C-25), 45.55 (t, C-22), 40.43 (q, 12-NMe), 32.56 (t, C-4), 25.84 (t, C-14), 
17.20 (q, C-26), 16.00 (q, 16-Me), 15.81 (q, 6-Me). 



COMPOUND P-22A 



Strain: 

The same as for P-22B 

Fermentation conditions: 
The same as for P-22B 

Isolation: 

The same as for P-22B 



OMe 




Bological actiwties ofsqfracin P-22A 
Antitumor activities 
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Antimicr obial activy: On solid medium 

Bacillus subtilis. lOjig/disk (6mm diameter): NO ACTIVE 



Spectroscopic data: 

HRFABMS m/z 511.290345 [M+H] + (calcd for C28H39N4O5 511.292046 □ 
1.7 mmu); LRFABMS using m-NBA as matrix m/z (rel intensity) 511 
[M+H]+ (61), 409 (25), 391 (4); iH NMR (CD3OD, 500 MHz): 6.68 (s, H-15), 
6.44 (s, H-5), 3.71 (a, 7-OMe), 3.67 (s, 17-OMe), 2.72 (s, 12-NMe), 2.28 (s, 
16-Me), 2.20 (s, 6-Me), 0.87 (d, J= 7.1 Hz, H-26); 



COMPOUND P-19B 

OMe 




NH 2 
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Strain: 

The same as for P-22B 



Fermentation conditions: 
The same as for P-22B 



Isolation 

The same as for P-22B 



Biological activities of sqfracin P-19B 
Antitumor activities 
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Antimicrobial actiw: On solid medium 

Bacillus subtilis. lOjig/disk (6mm diameter): NO ACTIVE 



Spectroscopic data: 

HRFABMS m/z 495.260410 [M-H 2 0+H] + (calcd for C27H35N4O5 495.260746 
□ 0.3 mmu); LRFABMS using m-NBA as matrix m/z (rel intensity) 495 [M- 
H 2 0+H] + (13), 460 (3), 391 (2); *H NMR (CD3OD, 500 MHz): 6.67 (s, H-15), 
6.5 (s, H-5), 3.73 (s, 7-OMe), 3.71 (s, 17-OMe), 2.29 (s, 16-Me), 2.24 (s, 6- 
Me), 1.13 (d, J- 7.1 Hz, H-26); 
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New Safracin compounds obtained by knock out 
SAFRACIN D 

OMe 




C27H34N4O7 
Exact Mass: 526,24 
Mol. Wt.: 526,58 
C, 61,58; H. 6,51; N, 10,64; O, 21,27 



Strain: 

sac r with sacJ expression reconstitution from P.fluorescens A2-2 (PM-S1- 
007) 

Fermentation conditions: 

50 litres of the SAM-7 medium (50 1) composed of dextrose (3,2%), 
mannitol (9,6%), dry brewer's yeast (2%), ammonium sulphate (1,4%), 
potassium secondary phosphate (0,03%), potassium chloride (0,8%), Iron 
(III) chloride 6-hydrtate (0,001%), L-ryrosine (0,1%), calcium carbonate 
(0,8%), poly- (propylene glycol) 2000 (0,05%) and antifoam ASSAF 1000 
(0,2%) was poured into a jar-fermentor (Bioengineering LP-351) with 75 1 
total capacity and, after sterilization, sterile antibiotics (amplicillin 0,05 g/1 
and kanamycin 0,05 g/1) were added. Then, it was inoculated with seed 
culture (2%) of the mutant strain PM-S 1-007 The fermentation was carried 
out during 89 h. under aerated and agitated conditions (1.0 1/1/min and 
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500 rpm). The temperature was controlled from 27°C (from the inoculation 
till 24 hours) to 25°C (from 24h to final process). The pH was controlled at 
pH 6,0 by automatic feeding of diluted sulphuric acid from 27 hours to 
final process. 

Isolation: 

The cultured medium (45 1) thus obtained was, after removal of cells by 
centrifugation, adjusted to pH 9,5 with diluted sodium hydroxide, 
extracted with 25 liter of ethyl acetate twice. The mixture was carried out 
into an agitated-vessel at room temperature for 20 minutes. The two 
phases were separated by a liquid-liquid centrifuge. The organic phases 
were frozen at -20°C and filtered for removing ice and evaporated until 
obtention of a 35g. oil-dark-crude extract. After a 5 1. hexane triturating, 
the extract (12,6g) was purified by a flash-chromatographic column (5,5 
cm diameter, 20 cm length) on silica-normal phase, mobile phase: Ethyl 
acetate: MeOH: 1 L of each 1:0; 20:1; 10:1; 5:1 and 7:3. 250 ml- fractions 
were eluted and pooled depending of the TLC (Silica-Normal, EtOAc:MeOH 
5:2, Safracin D Rf 0,2, safracin E 0,05). The fraction containing impure 
safracin D and E was evaporated under high vacuum (2,2 g). An additional 
purification step was necessary to separate D and E on similar conditions 
(EtOAc:MeOH from 1:0 to 5:1), from this, the fractions containing safracin 
D and E are separate and evaporated and further purification by Sephadex 
LH-20 column chromatography eluted with methanol. 

The safracins D and E obtained were independent precipitated from 
CH2CI2 (80 ml) and Hexane (1500 ml) as a green/yellowish-dried solid (800 
mg safracin D) and (250 mg safracin E). 



Biological activities Safracin D 
Antitumour screening: 
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Antimicrobial activy: On solid medium 

Bacillus subtUis. 10ng/disk (6mm diameter): Inhibition zone: 15 mm 
diameter 



Spectroscopic data 

ESMS: m/z 509 [M-H 2 0+H]+; iH NMR (CDCI3, 300 MHz): 6.50 (s, C-15), 
4.02 (s, OMe), 3.73 (s, OMe), 2.22 (s, Me), 1.85 (s, Me), 0.80 (d, J= 7.2 Hz); 
«C NMR (CDCI3, 75 MHz): 186.51, 181.15, 175.83, 156.59, 145.09^ 
142.59, 140.78, 137.84, 131.20, 129.01, 126.88, 121.57 (2 x C), 82.59,' 
60.92, 60.69, 53.12, 21.40, 50.68, 50.22, 48.68, 40.57, 29.60, 25.01,' 
21.46, 15.64, 8.44. 



SAFRACIN E 

OMe 




NH 2 



C27H34N4O6 
Exact Mass: 510,25 
Mol. WL: 510,58 
C, 63,51; H, 6,71; N, 10,97; O, 18,80 

Strain: 

The same than safracin D 

Fermentation conditions: 
The same batch as safracin D 

Isolation: 

See safracin D conditions 

Biological actimties Safracin B 
Antitumour screening: 
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Antimicrobial actiw: On solid medium 

Bacillus subtilis. lO^g/disk (6mm diameter): 9,5 mm inhibition 



Spectroscopic data 

ESMS: m/z511 [M+H] + ; *H NMR (CDC1 3 , 300 MHz): 6.51 (s, C-15), 4.04 (s, 
OMe), 3.75 (s, OMe), 2.23 (s, Me), 1.89 (s, Me), 0.84 (d, J= 6.6 Hz); 
NMR (CDCI3, 75 MHz): 186.32, 181.28, 175.83, 156.43, 145.27, 142.75, 
141.05, 137.00, 132.63, 128.67, 126.64, 122.00, 120.69, 60.69, 60.21, 
59.12, 58.04, 57.89, 50.12, 49.20, 46.72, 39.88, 32.22, 25.33, 21.29, 
15.44, 8.23. 
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Example 5. Cross-feeding exp eriments 

Heterologous expression of safradn biosynthetic precursors genes for P2 
and PI 4 production 

In the attempt to shed light on the mechanism of the P2 and P14 
biosynthesis we have cloned and expressed the downstream NRPS genes to 
determine their biochemical activity. 

To overproduce PI 4, sacBFGH genes were cloned (pB7983) (Fig. 4). 
To overproduce P2 in a heterologous system, sacD to sacH genes were 
cloned (pB5H83)(Fig. 4). For this purpose we PCR amplified fragments 
harboring the genes of interest using oligonucleotides that contain a Xbal 
restriction site at the 5' end. Oligonucleotides PFSC79 (5'- 
CGTCTAGACACCGGCTTCATGG-3') and PFSC83 (5'- 

GGTCTAGATAACAGCCAACAAACATA-3') were used to amplify sacB to 
sacH genes; and oligonucleotides 5HPT1-XB (5'- 
CATCTAGACCGGACTGATATTCG-3') and PFSC83 (5'- 

GGTCTAGATAACAGCCAACAAACATA-3') were used to amplify sacD to 
sacH genes. The PCR fragments digested with Xbal were cloned into the 
Xbal restriction site of the pBBRl-MCS2 plasmid (Kovach et al, 1994). The 
two plasmids, pB7983 and pB5H83, were introduce separately into three 
heterologous bacteria P. fluorescens (CECT 378), P. putida (ATCC12633) 
and P. stutzeri (ATCC 17588) by conjugation (see table II). When culture 
broth of the fermentation of the transconjugant strains was checked by 
HPLC analysis, big amounts of P14 compound was visualized in the three 
strains containing pB7983 plasmid, whereas big amounts of P2 and some 
P14 product were observed when pB5H83 plasmid was expressed in the 
heterologa bacteria. 
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Cross-feeding 

As it was shown in Example 4, the sacF- (PM-S 1-008) and sacG 
(PM-S1-009) mutants were not able to produce neither safracins nor P2 
and P14 compounds. The addition of chemically synthesized P2 to these 
mutants during their fermentation yields safracin production. 

Moreover, the co-cultivation of an heterologous strain of P. stutzeri 
(ATCC 17588) harboring plasmid pB5H83 (PM- 18-004), which expression 
produces P2 and P14, with either one of the two mutants sacF- and sacG 
resulted in safracin production. The co-cultivation of an heterologous 
strain P. stutzeri (ATCC 17588) harboring plasmid pB7983 (PM- 18-005), 
which expression produces only P14, with either one of the two P. 
fluorescens A2-2 mutants mentioned before resulted in no safracin 
production at all. These results suggest that P14 is transformed into P2, a 
molecule that can easily be transported in and out through the 
Pseudorrwnas sp. cell wall and which presence it is absolutely necessary 
for the biosynthesis of safracin. 



Example 6. Biological productio n of new "unnatural" molecules 

The addition of 2g/L of an specific modified P2 derivative precursor, 
P3, a 3-hydroxy-5-methyl-0-methyltyrosine, to the sacF mutant (PM-S1- 
008) fermentation yielded two "unnatural" safracins that incorporated the 
modified precursor P3 in its structure, Safracin A(OEt) and Safracin 
B(OEt). 



SAFRACIN B-Etoxi (Safracin B (OEt)) 
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OEt 




NH 2 

C30H40N4O7 
Exact Mass: 568,29 
Mol. Wt: 568.66 
C, 63.36; H. 7.09; N. 9.85; 0. 19.69 

Strain 

saf F - mutant from P.fluorescens A2-2 (PM-S 1-008) 
Fermentation conditions: 

Seed medium containing 1% glucose; 0.25% beef extract; 0.5% bacto- 
peptone; 0,25% NaCl; 0,8% CaC03 was inoculated with 0,1% of a frozen 
vegetative stock of the microorganism, and incubated on a rotary shaker 
(250 rpm) at 27°C. After 30h of incubation, the 2% (v/v) seed culture of the 
mutant PM-S 1-008 was transferred into 2000 ml Erlenmeyer flasks 
containing 250 ml of the M- 16 B production medium, composed of 15,2 % 
mannitol; 3,5 % Dried brewer's yeast; 1,4 % (NH 4 )2 0,001%; FeCfe; 2,6 % 
COsCa and 0,2% P3 (3-hydroxy-5-methyl-0-methyltyrosine) The 
temperature of the incubation was 27°C from the inoculation till 40 hours 
and then, 24°C to final process (71 hours). The pH was not controlled. The 
agitation of the rotatory shaker was 220 rpm with 5 cm eccentricity. 

Isolation 

4 x 2000/250 ml Erlenmeyer flasks were joined together (970 ml), 
centrifuged (12.000 rpm, 4°C, 10', J2-21 Centrifuge BECKMAN) to remove 
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cells. The clarified broth (765 ml) was adjusted to pH 9,0 by NaOH 10%. 
Then, the alkali-clarified broth was extracted with 1:1 (v/v) EtOAc (x2). The 
organic phase was evaporated under high vacuum and a greasy-dark 
extract was obtained (302 mg). 

This extract was washed by an hexane trituration for removing impurities 
and the solids were purified by a chromatography column using Silica 
normal-phase and a mixture of Ethyl Acetate: Methanol (from 12:1 to 1:1). 
The fractions were analyzed under UV on TLC (Silica 60, mobile phase 
EtOAc:MeOH 5:4. Rf 0,3 (Safracin B-OEt and 0,15 Safracin A-OEt). From 
this, safracins B OEt (25 mg) and safracin A OEt (20 mg) were obtained. 



Biological activities of safracin B (OEt) 
Antitumor activities 




Antimicrobial actiw: On solid medium 

BacUhis subtilis. lO^g/disk (6 mm diameter): 17,5 mm inhibition zone 



Spectroscopic data: 

ESMS: m/z 551 [M-H 2 0+H] + ; *H NMR (CDCI3, 300 MHz): 6.48 (s, H-15), 
2.31 (s, 16-Me), 2.22 (s, 12-NMe), 1.88 (s, 6-Me), 1.43 (t, J= 6.9 Hz, Me- 
Etoxy), 1.35 (t, J= 6.9 Hz, Me-Etoxy), 0.81 (d, J= 7.2 Hz, H-26) 



52 



SAFRACIN A-Etoxi (Safracin A (OEt)) 



OEt 




NH 2 



C3oH4oN 4 0 6 

Exact Mass: 552,29 
Mol. Wt: 552,66 
C, 65,20; H, 7,30; N, 10,14; O. 17,37 



Strain: 

The same as for Safracin B (OEt) 

Fermentation conditions: 

The same as for Safracin B (OEt) 

Isolation: 

4 x 2000/250 ml Erlenmeyer flasks were joined together (970 ml), 
centrifuged (12.000 rpm, 4°C, 10', J2-21 Centrifuge BECKMAN) to remove 
cells. The clarified broth (765 ml) was adjusted to pH 9,0 by NaOH 10%. 
Then, the alkali-clarified broth was extracted with 1:1 (v/v) EtOAc (x2). The 
organic phase was evaporated under high vacuum and a greasy-dark 
extract was obtained (302 mg). 

This extract was washed by an hexane trituration for removing impurities 
and the solids were purified by a chromatography column using Silica 
normal-phase and a mixture of Ethyl Acetate: Methanol (from 12:1 to 1:1). 
The fractions were analyzed under UV on TLC (Silica 60, mobile phase 



53 



EtOAcMeOH 5:4. Rf 0,3 (Safracin B-OEt and 0,15 Safracin A-OEt). From 
this, safracins B OEt (25 mg) and safracin A OEt (20 mg) were obtained. 



Biological activities of safracin A (OEt): 
Antitumor activities 




Antimicro bial actiw: On solid medium 

Bacillus subtilis. lOjag/disk (6 mm diameter): 10 mm inhibition zone 
Spectroscopic data: 

ESMS: m/z 553 [M+H]- i H NMR (CDC1 3 , 300 MHz): 6.48 (s, H-15), 2.33 (s, 
16-Me), 2.21 (s, 12-NMe), 1.88 (s, 6-Me), 1.42 (t, J= 6.9 Hz, Me-Etoxy),' 
1.34 (t, </ = 6.9 Hz, Me-Etoxy), 0.8 (d, J= 6.9 Hz, H-26) 
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Example 7. Enzvmatic transformation of Safracin B into Safracin A 

In order to assay the enzymatic activity of conversion of safracin B 
into safracin A, a 120 hours fermentation cultures (see conditions in 
Example.2.BfoZogicaZ assay (biotest) for safracin production) of different 
strains were collected and centrifuged (9.000 rpm x 20 min.). The strains 
assayed were P. fluorescens A2-2, as wild type strain, and P.ftuorescens 
CECT378 + pBHPT3 (PM- 19-006), as heterologous expression host. 
Supernatant were discarded and cells were washed (CINa 0.9 %) twice and 
resuspended in 60 ml phosphate buffer 100 mM pH 7.2. 20 ml from the 
cell suspension was distributed into three Erlenmeyer flask: 

A. Cell suspension + Safracin B (400 mg/L) 

B. Cell suspension heated at 100 °C during 10 min. + Safracin B 
(400 mg/L) (negative control) 

C. Cell suspension without Safracin B (negative control) 

The biochemical reaction was incubated at 27 °C at 220 rpm and 
samples were taken every 10 min. Transformation of safracin B into 
safracin A was followed by HPLC. The results clearly demonstrated that 
the gene cloned in pBHPT3, sacH, codes for a protein responsible for the 
transformation of safracin B into safracin A. 

Based on this results we did an assay to find out if this same 
enzyme was able to recognise a different substrate such as ecteinascidin 
743 (ET-743) and transform this compound into Et-745 (with the C-21 
hydroxy missing). The experiment above was repeated to obtain 
Erlenmeyer flasks containing: 

A. Cell suspension + ET-743 (567 mg/L aprox.) 



55 



B. Cell suspension heated at 100 °C during 10 min. + ET-743(567 
mg/L) (negative control) 

C. Cell suspension without ET-743 (negative control) 

The biochemical reaction was incubated at 27 °C at 220 rpm and 
samples were taken at o, 10 min, lh, 2h, 3h, 4h, 20h, 40h, 44h, 48h. 
Transformation of ET-743 into ET-745 was followed by HPLC. The results 
clearly demonstrated that the gene cloned in pBHPT3, sacH, codes for a 
protein responsible for the transformation of Et-743 into Et-745. This 
demonstrates that this enzymes recognises ecteinascidin as substrate and 
that it can be used in the biotransformation of a broad range of structures. 
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SEQUENCES 

<110> PharmaMar 

<120> Hie gene cluster involved in safracin biosynthesis and its uses for 
genetic engineering 
<130> aaa 
<160> 15 

<170> Patentin version 3.1 

<210> SEQID 1 
<211> Lenght: 26705 
<212> Type DNA 

<213> Organism:Pseudomonas fluorescens A2-2 

<220> Feature: 

<221> NAME/KEY: sacB 

<222> LOCATION: (6080)..(9268) 

<223> OTHER INFORMATION: non ribosomal peptide synthetase gene 



<220> 

<221> sacC 

<222> (9275). .(13570) 

<223> non ribosomal peptide synthetase gene 



<220> 

<221> sacD 

<222> (13602).. (14651) 

<223> Hypothetical protein gene 
<220> 

<221> sacE 

<222> (14719).. (14901) 

<223> Hypothetical protein gene 

<220> 

<221> sacF 

<222> (14962)..(16026) 

<223> methyl-transferase protein gene 

<220> 

<221> sacG 

<222> (16115)..(17155) 

<223> methyl-transferase protein gene 



<220> 

<221> sacH 

<222> (17244). .(17783) 

<223> hypothetical protein gene 

<220> 

<221> sad 

<222> (1854).. (2513) 

<223> Complementary, methyl-transferase protein gene 

<220> 
<221> sacJ 
<222> (335). .(1861) 

<223> Complementary, mono-oxygenase protein gene 

<220> 
<221> orfl 

<222> (18322)..(19365) 

<223> aminoacid peptidase-like protein 



<220> 



<221> orf2 

<222> (2 1169).. (22885) 

<223> complementary; hox-like regulator protein 

<220> 

<221> orf3 

<222> (23041).. (23730) 

<223> complementary; glycosil transferase-like protein 



<220> 
<221> orf4 

<222> (25037)..(26095) 

<223> isochorismatase-like protein 



<400> SEQUENCE 1 

ctgcaggtgg tttgcgcgcg gaagacccgc cactgccggt gcgctcgttt gaattgcaca 60 
tggcgtggcg tgggtcgcag gataatgatc cgggggagcg gtggttgcgg tcgcggattc 120 
agatgttttt tggcgacccc gatagccttt aattaaactc cactaaaatc ggcgattgca 180 
gagcctgagt acaacacggc tactggactg aagtgggcgc atcgtgccgc atagccatag 240 
tgatctcggt gtgtctcgcc atgtcccggc ccaggtcgta ggtcatgctc ttgcgcattg 300 
ccagcatctt cgtgctccct tgccagctgt ttcaggtcag gctctgacgc gcggatttag 360 
aatcgtccag cagccactca cccaagcgct ccttggccaa ggtcgatttt ttaccgaccc 420 
agatcaccac gccatccggc ctgacgatga ctccctcgcc agcggacaag ctgttattgt 480 
gcgaatcctc gcagatagat gccgtttgca accccggaaa atcacgtcga agatcggcgg 540 
ccagtgcttt ggcacgatga tgtaccagga caaaccgccc tgctcgcagc aactgggtca 600 
agctctgtcg tggtaatcgc tcaccctcgg gaagcaggct caacaggggt aaacgtcggc 660 
ccaccaagcg atgatcgcct cggcgccgca cactgtcata gcgcacgccc tctcccgcca 720 
gtgcactgac caccttctgc gccacatacg gggcccgtgt cgcctgcaag ccgatccagt 780 
gaatcagacg gcctataggg ccggaagccg tattgaatcg gaaaagcaga tccgtgttgc 840 



gcaaggctgc cgccgcaata ggcctacgct cggcctcgta actctccaga agatccatcg 900 
gcaatgtggc ctgtatcaca cccgccagct tccaggcgag gttcgccgcg tcaccgatcc 960 
ccatctgcaa accttgcccc ccggcgggga cgtgggtgtg agcagcatct cccagcagaa 1020 
ataccctccc ctggcgataa tgagtcgcca ggcgctgctg gctgcggtaa cgagcgctcc 1080 
acagcacctg cgccaatccg aaatcggtte ccagaatatc tttcatccct ccggcaattt 1 140 
cctcgtgggt gaccggctgt ttgaccggag tatccatacg ttcgttgtct tcgatactga 1200 
cgcgataact gccatcgggt aatggaaaca gggcaaccag acccctggag accgaccttg 1260 
catggactgc aggtgaaggc ggatttctca agacaacgtc cgccaccacc aacgaatgct 1320 
tgtagtcctg cccgacaaac gaaatattga ggagttggcg gacagaactg ttgaccccat 1380 
cagcccccag cacccagtcg tagcggcttt gctgcacgct gccggtctcg ctgtgttcca 1440 
gggttacctc aacgtgtaaa tcgccggcat ccagagcctt tagcgcatac ccacgcttca 1500 
gattcacccc cttgcgattg acccaatcag tcaacaccga ctcggtctga gactgtggga 1560 
tgatcaccat gtggggatac tcacaaggga gtttggaaaa tgagagcgtt cggcccgcct 1620 
tgtcacccaa cggcgcgctt gcccagacga tcccccgacg tatcatctca tctgccacgc 1680 
cccaggcatt gagcaactcc agggtcacgg gctccaagcc aaaggcccgg gaatggggag 1740 
aggcagccgg tcttttatcg atgagatcaa ccgctatacc caactcggcc agagctgcag 1800 
ccacagccaa cccgacgggc cccgccccca cgaccaggac ctgtttattt ttaacgacca 1860 
tgatcaccca cctctccaca gcaggggcgg aacgtggcga caatgacgtg ctggtcggtg 1920 
acgctcgttt ccatgctctg cagcccggcc gtttgcgcga gctggattac ctcatgctct 1980 
gatcgataat ctgccagtga gcccagcagc cagttgatca atgtggtcaa tccccagccc 2040 
gcaggcaggt tctccaccag gcagaaaagg ccctgaggct tgagcacgcg acggacctca 2 100 
ttcaggctga cagacttgtc agcccaatgg ccgaacgaca tcgagcacac caccagatcc 2 160 
atgctttgcg aggggaatgg caaggcttcg gcaactcctt tgacgaacga ggcgaaggga 2220 



cgtcgtttgg cggcctcgtc gaccatgccc tgagccgggt cgacgccttc gaagcgcgcc 2280 
tcaggccaca gagcgaacat gcgttcgatc aatgcgccgg taccacaacc gatgtccaga 2340 
acacgctccg gtctcgaggt gcacatccat cggctcaaca ttcgcagaca gtcgtcatgg 2400 
gcctggctca gtttggtacc gtacttctct tcatacgtcg gcgcgatacg gctgaacgtc 2460 
cgcacaaaac ctggatttcc attgcctttc ccgccagaaa atacgttaga cattattgaa 2520 
catccatata tcaacagtta tccgccaagg accatagtag agaaaatcca tcccatccaa 2580 
ataaaaatta aataagtggg gctaaccgca atccagggaa actctgaaaa ggcccgctac 2640 
ttgtcgacgc ggctgtctgg aggccgcata gttactgaac ttactattaa aagactgggc 2700 
tttttcagag ccccaccgga tgttggctcc ttgtccatca tttcgggggc actgtaacat 2760 
tctgttacct ggctatcgct tgacttttaa tctgaacggg caattatagg tctaaccgca 2820 
acagccccac ggcgcttaag ttcgaaaaaa gtagctgcac cttgctcaac tgcatcttgt 2880 
aataaggggc actttacaag ccgcataaga cataaatttt atgctgactc cccaaacaag 2940 
cacgacaagt aaaaaacact tgtccaatac caaggagggt atacggtgca agcttcttta 3000 
cgtcaaaaag ttctctgctt acagcagtct atcgatccca gccagccagg catgttactt 3060 
gaagtcgctt ttcatgtgat cacccatctt tctacttcgc agttggtatc gcgtatcgag 3120 
agagtggtcg agcgacatgc gtctttacgg cagcgctttg tcatgcgcaa tggcacttac 3180 
tggattgaac aagccccacc gcaacaacga cgctactgcg tggtacgcac ctatgatgaa 3240 
gcatcgaccg atgcactgct ggcgccgagc cgcgagcaca tcggggttga gtctgagcgt 3300 
ttgttccgcg ccgaagtcgt tgagcgcagc gacggacaac gctacttggt cttccgaatt 3360 
catcacatca tcgccgacct gtggtctgtc ggcctcctga ttcgagactt tgccgaagac 3420 
tgtatggacc gctccagcat caccctggcg tcaagaccga ttgccccgtt gatcgaccct 3480 
gagttctggc ggcaccaaat gtcacaggac actccgtttt ccttgcccat ggcctccctg 3540 
gaacagcaca cggaccgccg catggtgctg tcttcgttcg ttattgatca ggagagcagc 3600 



gctgacctgg cccgcctggc cacagcctgc gcggtaaccc cgtacaccgt aatgctcgcc 3660 
gcacaagtat tggcgctgtc cagaatcggc cagagtggcc gtctgtcact tgcggtgacg 3720 
ttccatggcc gcaacagggg caacaaggat gcggtaggtt acttcgccaa tacgcttgcc 3780 
gtgcctttcg atgtcagcga atgcagcgtg ggcgagtttg tcaaacgcac cgccaagcgc 3840 
ctggatgagg cctcaaaagc cagcgtcggt gccggttatc ccgaattggc agagttcatg 3900 
acgccgctgg gatgggctgc gaccgccccg accaatgcgg tgatttacca gcaggatatg 3960 
ccaggcatgc caagaggatt ggcggcggct ctgctgggat tgggcacggt gcagttgggc 4020 
gagatggcgc tgaccgcgga acaggcaccg cccagcatcg gcccgtttgc cactgcgctg 4080 
ctgctgacgc gccacgacgg caagctgcat ggccgggtcg aggtcgatcc tgcgcagcat 4140 
cccggttggc tggcagaggc gttagccaga cagttcgctg tgatcctgcg ggaaatggtg 4200 
cgtgatccac aggccagact gtcagccttg ccagcgtgcc tgttacacca accaaaatac 4260 
ccgagccaag cgcggccggc gcctgcgtca gaaacattgg tagccacctt tctccggcaa 4320 
gtcgccatca cgccggacaa gcccgcgctg cgtacgccgc aggccagcat cagctatagc 4380 
gaattggcca gtcgagtcgc caggctctcg gcagccttgc gcgtacgcgg cttcaaacct 4440 
gaacagaccc tggcaatact cctgcctcgc gatatcaatc tggtacccgc tctgctggcg 4500 
atcatggcct gcggtggcag ttatgtgcca ctcagtgacg cgaaccccgc cgaactcaac 4560 
cgttcgattc tgaccagggc ccgttgccgc gcgattctca cggatcagga gggtttgacc 4620 
cgtttcgctc acttggcgcc ctgctggtcc ttgagcgacc tgctgtcgat gcccgacgcc 4680 
ccgctgcagg accagtccaa gcttcaagcc aaggcctata tcctatttac cagcggctcc 4740 
accggtgaac caaaaggcgt ggcgatcacc catgctaatg ccgccaacct gctgcgttgg 4800 
gcggctctcg attgtggccc cgagtacctg gcgcaaacac tggcggcaac ccccactacg 4860 
ttcgatcttt cgattttcga gatgtttgct ccccttatgg tcggtggctg cgtacagccc 4920 
gtttcctcgg tcatggcgct gatcgacaat ccggccctgc taaagggcac aacactgatc 4980 



aatacggtgc cgtcggtggc cgacgctttg ttgcagcatg atgtactggt gccttccttg 5040 
cgcatgctca acctcgcggg agaacccctg aaccgggatc tttacctgcg gcttcaggca 5 1 00 
aaactgaccg ccacacgcat cgtcaacctc tacggcccga cggaaacaac aacctattcc 5 1 60 
accgccctgg tgatcgagcc cgcacaacaa gagatcacca tcggttttcc actgtatggc 5220 
acctgggtgg atgtcgttga tcaaaacatg caaagcgtcg gtatcggtgt acctggcgag 5280 
ttgatcattc atggacacgg cgtggcgcaa ggctatgtca gcgaccccgt gcgtagcgcc 5340 
gcttctttcc tgccggcatc cgatggcttg cgttgctacc gcacgggaga ccgtgtccgc 5400 
tggttgcccg atggccgcct ggactttatc ggtcgagagg atgatcaggt caaagttcgc 5460 
ggtttccggg tcgagttggg gcctgttcag gcggcactgc atgccattga gacgattcat 5520 
gaatccgcag tagtcgttgt gccgaaaggg cagcagcgca gcatcgtggc gttcatcgtc 5580 
ctcaaagcgc cgagcgaaga tgaagcggtg cagcgcaata acatcaaaca acacttactc 5640 
ggcgtactcc cctattacgc actaccggac aagtttattt ttgttaaagc actgccaaga 5700 
aacacacatg gaaaaatcga cagaacgctg ctcttgcaac atgagcccca gactgagcaa 5760 
gaaagcgcca tgcgagatgc gaccgacgtc gaacatcgca tcgccaactg ctggcaaacc 5820 
atcatcggac accccgtcca actccacgaa aacttcctgg acattggcgg ccactcgctt 5880 
tcgcttacgc atttaacggg cctactgaga aaagaattta atattcatat ttctctacac 5940 
gacctctgga tcaggccaac catagaacaa caggccgact tcattcataa gttgcaaaat 6000 
tcggtattga caaaacctgc cgccgcgcca atcccgcgac ttgaccgaaa gatctctcat 6060 
cattaatcag gagtaccgca tgagcgtcga tacatgcagg actgcaactt tccctgcgtc 6120 
atacggccag gaacagatct ggtttctgaa cgaactaaac ccgcactctc aactggctta 6 180 
taccctggcg atgaaagtat ctatcgccgg gaaattgaac acactgcggt tgcagcgtgc 6240 
ggtcaaccaa gtggtggcct cccaggaaat tttgagaaca tcattcgcct ataaaaacca 6300 
gaagttgagc caggtcattt caccctccgc gacactgccc attcgcagcg cgcactgcat 6360 



tgacgatgta cctgggctgc aacgcctgat caacatggaa gcccagcgtg gctggtcgct 6420 
gagcagcgcg ccactgtacc gcttgctgct gataaaaacc ggcgaccagc aacatgagct 6480 
ggtcatctgc acccaccata tcgtctgcga tggcatctcg ctgcaactgc tgctgcaaaa 6540 
aatagtcagc gcctatcaag gccaaagcga tgggcgggtg ctcacaagtc cggatgaaga 6600 
gaccctgcaa ttcgtcgatt atgcggcctg gtcaaggcag cacgaatatg ccggtctcga 6660 
gtactggcgc cagcaactgg ccgacgcccc gacaatcctg gatatttcga caaaaaccgg 6720 
ccgaagtgag caacagacat ttctcggcgc gcgaattccc gtcgagttca gccaccacca 6780 
atggcaagca ttgcgccaga tattcagacc ccagggtatc tcctgcgcgg cggtgttcct 6840 
ggcggcctac tgcgtcgtcc tgcaccgcct ggccgagcag gacgacattc tgatcgggct 6900 
gccaacttca aatcgcctgc gtccggagtt ggcacaggtg atcggctacc tgtccaatct 6960 
gtgcgtgttt cgcagccagt atgctcacga ccagagcgtc acagactttc ttcaacaggt 7020 
tcaattgacc ttacccaact tgatcgagca cggggagacg cctttccagc aagtactgga 7080 
aagtgttgag catacccggc aagccggtgt gacgccgttg tgccaagtac tgtttggtta 7140 
tgagcaggac gttcgacgca cgctggatat cggcgacctg °caattgacgg tctcggatgt 7200 
ggacacgggg gccgcacgcc tggatctatc gctgttcttg ttcgaggacc acgaactcaa 7260 
cgtttgcggt tttctggaat atgccacgga ccgtatcgac gccgcatctg cgcaaaacat 7320 
ggtgcgcatg ctcagcagcg tgctacgcga gttcgttgcg gcgccgcagg cgccgctcag 7380 
cgaagtacag ctgggggcgg cggattccca agcccagaca cctgcgatcg caccagcatt 7440 
cccaagcgtg ccggctcgtc tgttcgcctt ggcagacagt caccccaatg cgaccgcgct 7500 
gcgtgacgag caaggtgaac tgacctatgc gcaagtttgc caacagattc tgcaggcagc 7560 
ggccactctg cgagcccagg gggcgaaacc tggaaccctg atcgcggtca tcggcgagcg 7620 
cggtaacccc tggttgatcg ccatgttggc gatctggcaa gtcggcggta tctatgtgcc 7680 
attgtccaag gacctgcccg aacagcgcct gcaaggcatc ctggcggaac tcgaaggggc 7740 



catactgatt accgacgaca ccacgccgga acgcttccgg caacgtgtga cgctgcccat 7800 
gcacgcctta tgggccgatg gcgcaacgca tcacgagcgg cagacgacgg acgccagccg 7860 
gctgtctggc tacatgatgt acacctcggg atcgaccggt aaaccgaaag gcgtgcatgt 7920 
cagccaggcc aacctggtcg cgaccctgag cgcattcggc cagctgctgc aggtgaaacc 7980 
cagcgatcgg atgctcgcac tgacgacctt ctccttcgac atttcgctgc tcgagctgct 8040 
gcttcccctg gtccagggcg ccagcgtgca aatcgctgtc gcacaggctc aacgcgatgc 81 00 
ggaaaagctc gcgggctatc tcgcagaccc tcggatcacg cttgttcagg ccacaccggt 8160 
gacctggaga ctattactgt cgacaggctg gcagccacgg gaaagcctga ccctgctgtg 8220 
cggtggcgaa gcgctgccac aggatctggc ggacaggttg tgcttgccgg gcatgacctt 8280 
gtggaacctc tacggcccca ccgaaacaac aatctggtcc acggcctgcc gcctgcaacc 8340 
gggtgcgccg gtgcaactgg gccatcccat tgcaggtacg caaatagccc tggtggatcg 8400 
gaacctgcgc agcgtgccca gaggggttat cggtgaactg ctgatttgcg gccccggcgt 8460 
cagccagggc tactatcgca acccggttga aacagccaag cggttcgtac cggacccgca 8520 
tggttcaggt aagcgcgcct atctgaccgg cgaccggatg cgcatgcagc aggatggttc 8580 
gctggcctat atcggccgac gtgacgacca gatcaagctg cgcggccacc gtatcgagct 8640 
gggagagatc gagacagcgt tgcgaaaact gcccggcgta cgggatgctg ccgcccaact 8700 
ccatgaccag gacccaagtc gaggcataca ggcctttgtc cagctttgcg caacggtcga 8760 
tgagagcctc atcgatatag gccagtggct ggaaacactg cgccaaacgc tgcctgaggc 8820 
gtggctgcct actgagtatt acaggatcga tggcatccct cttacctaca acggcaaacg 8880 
cgacaggaag cgcctcctgc accaggccgt caggctgcaa acactcagtc tgagggtggc 8940 
tcccagcagt gacaccgaga cccgggtgca gcagatctgg tgcgagctgc tcggtctcga 9000 
ggatatcggc gttacggatg attttttcca gttaggcggc cactccattc tggtggcgcg 9060 
catggtcgag cgcatcgaaa ccgcgtttgg acggcgcgta cctatcgcag atatctattt 9 120 



ttctccgacg atcgcccgtg tggcggcgac gctggactcc atgacatttg aacaaggact 9180 
ggccgcacac agcgtgaaag gcgattggga gttcaccgcc atcagccttc aacacaacgc 9240 
cgacagcaca gccgccgctc aggagagatg aatcatgcac agccccacta tcgatacttt 9300 
cgaggccgca ctgcgctcat tgcccgctgc ccgcgacgca cttggtgcct atcccttgtc 9360 
cagcgaacaa aagcgcctct ggttactggc ccaactggcg ggcacggcaa cgttgccggt 9420 
aacggtgcgt tatgcattca ccggcacggt ggaccttgct gtcgtgcagc agaacctgag 9480 
cgcgtggatc gcacacagcg agtccttacg cagccttttc gtcgaagtac tggaacgccc 9540 
cgtcaggctt ctgatgccta cgggcctggt gaaactggag tacttcgatc gcccgccatc 9600 
cgatgccgat atggccgagc tcataggcgc cgcctttgaa ctcgacaaag ggccgttgct 9660 
gcgtgcgttc atcactcgaa ccgctgcaca acagcatgaa ttgcatctgg tcggccatcc 9720 
tattgtcgtg gacgaacctt ccctgcagcg cattgcccaa accctcttcc agaccgaacc 9780 
cgatcatcag taccccgccg tcggtgcgat cgccgaggta ttccagcgcg aacagacact 9840 
ggcacaggat gcgcaaatca ccgaacaatg gcagcaatgg ggaataggcc ttcaggcgcc 9900 
tgcggcaacc gaaattccga ccgaaaaccc ccgccccgct atcaagggct cagatcgtca 9960 
agtacatgaa gcccttactg catggggcga ccaacccgta gcagaggccg aaattgtcag 10020 
cagttggctg accgtgctga tgcgctggca gggatcgcaa tcggcgcttt gcgcaatcaa 10080 
ggtgcgcgac aaggcgcatg ccaacttgat cggcccactg caaacctacc tgccggtccg 10140 
cgttgatatg ccggatggca gcaccctggc acaactgcga ctccaggtgg aggaacagct 10200 
caatggcaac gaccatccgt ccttttccac gctgctggaa gtttgcccac caaagcggga 10260 
cctgagtcgc accccctact tccaaaccgg cctgcagttc attgcgcacg atgttgaaca 10320 
gcgcgacttc catgccggca acttgacacg cctgccaacg aagcagccaa gcagcgacct 10380 
tgacctgttc atttcctgct gggtaagcga cggcacgctt ggcctgacgc tggattatga 10440 
ttgcgccgtg ctgaattcga gccaggtcga ggttctggcc caggcgctca tcagcgtatt 10500 



gtcagcgccc ggtgaacagc caatcgcaac cgttgcgctg atgggccagc aaatgcagca 10560 
1062? CCtS gCtcaggccc ac g§ ccccc g cacgacgccg ccgcaactga cactgaccga 

atgggtcgcc gccagcacgg aaaaatcccc gctggcggtt gcggtgatcg accacggcca 10680 
gcagctcagc tatgcagagt tatgggcaag agctgcactg gtagcggcga acatcagcca 10740 
gcatgtggca aagcctcgga gcatcatcgc tgtagcactg cccagatcgg ctgaatttat 10800 
tgcagcgctg ctgggggtag tgcgagcagg tcatgcgttc ttgcccatcg atccccgcct 10860 
gcccaccgac cgcatccagt tcctgattga aaacagtggc tgtgagttgg tcattacctc 10920 
tgatcagcaa tccgtggagg gttggccgca ggtcgccagg atacgaatgg aggcgcttga 10980 
tccagacatt cgctgggtgg cgccgacggg gctcagccac agcgatgccg cctacctgat 1 1040 
ctatacctcc ggcagcaccg gcgttccgaa gggagtcgtt gtcgagcacc ggcaagtagt 1 1 100 
gaataacatc ttgtggcggc aacgaacctg gccgctgacg gcacaggaca acgtgctgca 1 1 160 
taaccattcg ttcagcttcg atcccagcgt ctgggcgttg ttctggccgc tgctgaccgg 1 1220 
tggcaccata gtgctggcgg atgtcagaac catggaggac agcaccgccc tcctcgacct 1 1280 
gatgatccgc catgatgtca gcgttctggg tggcgtaccg agccttctcg gtacgctgat 1 1340 
cgatcatcca ttcgccaatg attgccgggc ggtcaagctg gtgctcagtg gcggcgaagt 1 1400 
cctcaacccc gaactggcac acaaaattca aaaggtctgg caggccgacg tcgccaacct 1 1460 
ctatggccct accgaagcga ccatcgatgc gctgtatttt tcgatcgaca aaaatgctgc 1 1520 
cggcgccatc ccgattggct atccaatcga caataccgac gcttatatcg tcgacctcaa 1 1580 
tctcaaccca gtcccgccag gcgttccggg agaaatcatg cttgctggcc agaaccttgc 1 1640 
gcgcggctat ttgggcaaac ctgcgcaaac cgcgcagcgc ttcctgccca acccatttgg 1 1700 
caacggacgc gtgtatgcaa cgggcgatct gggacgacgc tggtcatcgg gggccatcag 1 1760 
ctacctgggc cgacgcgacc aacaggtgaa gattcgcggg catcgcattg agcttaacga 1 1820 
agtcgctcat ctgttgtgcc aggcgcttga gctgaaggaa gccatcgtct tcgcccagca 1 1880 
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cgctggaacc gaacaggcac gcctggtggc ggccatcgag caacagccag gcctgcacag 
11940 

tgaaggtatc aaacaggaat tgctgcgcca cttgccagcc tatctgatcc ctagccagct 12000 
cctgctattg gatgaactgc caagaaccgc caccggcaag gtcgacatgc tcaagcttga 12060 
tcagttggca gcccctcagc tcaatgacgc cgggggcacg gaatgccgtg cgccacgtac 12120 
cgaccttgaa caatcggtca tgacggattt cgcccaagta ctcggcctca ctgcggtaac 12180 
gccggacacg gatttcttcg agcaaggcgg caactcgatt ctactcacgc gcctggcagg 12240 
caccttgtct gccaaatacc aggtgcagat tccactgcat gagtttttcc tgactccgac 12300 
cccggcagcg gtggcgcagg caattgaaat ctaccgtcgc gaaggcctca cggcactcct 12360 
gtcacgccag catgcacaaa cgctggagca ggacatctac ctggaagaac acattcggcc 12420 
ggatggctta ccacatgcca actggtacca gccttctgtc gtgtttctga ccggagccac 12480 
cggctacctg ggactgtacc tgatcgaaca gttgctcaag cgcaccacca gccgcgtcat 12540 
ctgcctgtgc cgtgcaaagg atgccgagca tgccaaggcc aggattctgg aaggcctgaa 12600 
aacctaccgc atcgacgtag gcagcgaact gyaccgggtg gagtacctca cgggcgacct 12660 
ggcgttgccg cacctgggcc tgagcgagca tcaatggcaa acgctggccg aagaggtcga 12720 
tgtgatttat cacaacggcg ccttggtcaa ctttgtctac ccctacagcg cactcaaggc 12780 
gaccaacgtg ggaggcacgc aggccattct ggaattggcc tgcaccgctc gactcaagag 12840 
tgttcagtat gtctccaccg tggatacgct cctggcgacg catgtccccc gcccttttat 12900 
cgaggacgat gcccccctgc gttccgccgt cggcgtacca gtgggctaca caggcagcaa 12960 
gtgggtggca gaaggggtgg ccaatcttgg cctgcgtcgc ggcattccgg tcagcatctt 13020 
ccgcccgggc ttgatcctgg gccataccga aacgggtgcc tcgcagagca tcgactacct 13080 
gctggtggcg ctacggggtt tcttgcccat gggcatcgtg ccggattacc cacgcatctt 13140 
cgacatcgtg cccgtggact aygtcgccgc ggcgatcgtg cacatatcga tgcaaccgca 13200 



gggcagggac aaattcttcc acctgttcaa cccggcgccg gtcaccatcc gccagttctg 13260 
cgactggatt cgcgaattcg gttacgagtt caagttggtc gacttcgaac acggtcggca 13320 
gcaggcattg agcgtaccgc ccgggcacct gctgtacccg ttggtccccc tgatcaggga 13380 
tgccgatccg ctgccccacc gcgcgctgga ccctgactac atccatgaag tgaaccccgc 13440 
actggaatgc aagcaaacct tagagctgct ggcctcctcg gacatcaccc tgtcgaaaac 13500 
cacaaaggct tacgcgcaca caattttgcg ctacctgata gacaccggct tcatggccaa 13560 
gcctggcgtg tagcggattg agcacaaaca ggacgaatat catggaatcg atagcctttc 13620 
ccattgcaca taagcccttc atcctgggct gtccggaaaa cctgccggcc accgagcggg 13680 
cgcttgcccc ttctgcggcg atggcgcggc aggttttgga gtacctcgaa gcgtgccccc 13740 
aggcgaaaaa cctcgagcag tacctcggga cgctgcgtga agtcctggcg cacctgcctt 13800 

f^oS? 0210 cg S act S at g accgatgatc cacgggaaaa ccaggaaaac cgcgacaacg 
138o0 

atttcgcctt cggtattgaa cgacaccagg gcgacactgt gaccctgatg gtcaaggcca 13920 
cccttgatgc agcgattcaa acgggcgagt tggtccaacg cagcggcact agcctggatc 13980 
actcggagtg gagcgacatg atgtcagtcg cacaggtgat tctgcagacg attgccgacc 14040 
ctcgggttat gcccgaatcc cgtttgacgt tccaggcacc gaaaagcaag gtcgaagaag 14100 
atgaccagga cccgctgcga cgctgggtgc gtggccacct gctgttcatg gtcctgtgcc 14160 
aaggcatgag cctgtgtacc aacctcctga tcagcgcggc ccacgacaag gacctcgaac 14220 
tggcgtgtgc acaggccaat cgcctgattc aactgatgaa catctcgcgc atcacgcttg 14280 
agtttgcaac cgacctgaac tcacaacagt acgtcagcca gattcgcccg acgctcatgc 14340 
cgscgatcgc gccgcccaag atgagtggca tcaactggcg tgaccatgtg gtgatgattc 14400 
gttggatgcg ccagtccacc gatgcctgga acttcattga gcaggcctac cctcaactgg 14460 
ctgaacgtat gcgaaccaca ttggcgcagg tctacagcgc tcatcggggg gtctgcgaaa 14520 
agttcgtagg cgaagaaaac accagtttgt tggccaagga aaacgccact aatacggccg 14580 
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gccaggtgtt ggaaaacctg aagaaatcga gattgaaata cctcaagaca aaaggttgcg 14640 
ccggtgcggg ataagccctg actgcgcctc gcccccatca aaaccggact gatattcggg 14700 
aaaacaaagg agagaagcat gccgacattt ctgggagacg acgacgcagt gccatgcgtg 

gtcgtcgtta acgccgacaa acactattcg atttggccaa gcgcgagaga cattccatca 14820 

ggttggtccg aagaaggatt caaggggtca cgttcagact gcttggaaca tatcgcgcaa 14880 

atctggccag agccgacggc atagatacaa cgtgatgcaa aaaatgcggg aaacatcaac 
1 4940 

taaccaaagc aaggaagaaa aatgacttca actcatcgca ccactgatca agtcaagcct 15000 
gctgttctgg atatgccagg cctgtcgggc attcttttcg gccacgccgc attccaatac 15060 
ctgcgggcca gctgcgaatt ggatctgttc gagcatgtcc gcgacctgcg cgaagccacc 15120 
aaggagagca tcagcagccg actgaagttg caggaacgcg ccgccgatat tctgctgctg 15180 
ggcgcgacct ccctgggcat gctggtcaag gaaaacggca tctaccgcaa tgccgatgtg 15240 
gttgaggatt tgatggccac ggacgactgg caacgtttca aggataccgt ggcctttgaa 15300 
aactatatcg tctatgaagg gcagctggac tttaccgagt ccctgcagaa aaacactaac 15360 
gtcggccttc agcgtttccc gggcgaaggg cgggacctct atcaccgcct gcaccagaat 15420 
cctaagctgg aaaacgtgtt ctaccgctac atgcgctcgt ggtctgaact ggccaaccag 15480 
gacctggtca agcacctcga cctgtcgcgc gtgaaaaaat tgctcgacgc gggtggcggt 15540 
gatgcggtca acgccatcgc cctggccaaa cacaatgagc aactgaacgt aacggtactg 15600 
gatatcgaca actccattcc ggtcactcag ggcaaaatca atgattccgg gctcagccac 15660 
cgggtgaaag cccaggcatt ggatatcctg caccaatcct tccctgaagg ttacgactgc 15720 
attctcttcg cccaccaatt ggtgatatgg accctcgaag aaaacaccca catgctgcgc 15780 
aaggcctacg atgcgctacc agaaggcgga cgcgtggtca tcttcaactc catgtccaac 15840 
gatgaaggcg acggcccggt catggccgca ctggacagcg tctactttgc ctgtctaccc 15900 
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gccgagggcg gcatgatcta ttcctggaaa cagtatgagg tctgcctggc ggaagccggc 15960 
ttcaaaaacc ccgtacgcac cgcgattcca ggctggaccc cacacggcat catcgtggcc 16020 
tacaagtaat tttgcctcct ccgcccctac tggggccgga ggagtcattt caacatttgc 16080 
gtcattgacg ccacctggcg atagggacac ccacatggca cgttcacccg agacaaatag 16140 
tgcgatgccg caacagataa gacagctttt atacagccaa ctgatttcgc aatcgattca 16200 
aaccttctgt gaactgcgcc tgcctgatgt tctgcaagca gctggccagc ctacctccat 16260 
cgaacggctt gctgagcaga cacacactca tatcagcgcc ctgtcacgct tgttgaaagc 16320 
gttgaaacca ttcgggctag tgaaagaaac cgacgaaggt ttttccttga ccgatctcgg 16380 
cgccagtctg acccacgacg cctttgcttc cgctcaaccc agtgctttgt tgatcaatgg 16440 
tgaaatgggc caagcctggc gtggcatggc gcagacaatc cgaaccggtg aatccagctt 16500 
caagatgtac tatggcatca gcctgttcga gtattttgaa cagcacccgg aacgccgggc 16560 
catttttgac cgttcccaag acatgggact ggacctggag atcccggaaa tcctggagaa 16620 
catcaacctg aatgacggtg agaacattgt cgatgtaggg ggtggttcag ggcatttgct 16680 
gatgcacatg ctggacaagt ggccagaaag cacaggcata cttttcgact tacccgtcgc 16740 
ggcaaaaatc gcgcagcaac atctgcacaa atctggaaaa gcaggctgct ttgaaatcgt 16800 
cgcaggggat tttttcaaga gcctgccgga cagtggcagc gtttacttgc tgtcccatgt 16860 
cttgcacgac tggggcgacg aagactgcaa ggccattttg gccacctgcc ggcggagcat 16920 
gccggacaat gcgctgttgg ttgtagtgga cttggtgatt gaccagagtg aaagtgccca 16980 
gcccaacccc acgggcgcaa tgatggatct ttacatgctg tccttgttcg gtatcgccgg 17040 
moT a§ag CgCaacgagg atgaattcag aaccctcatt gaaaacagcg gcttcaacgt 

caaacaggtg aagcgcctgc caagtggaaa cggcatcatc ttcgcctacc caaaataaat 17160 
gatcctcatt gcccctcgcc actttccagg ggggctattt tattctcggg tgattccccc 17220 



cctaatgatt acaaggaaga cacatgtcga cgctggttta ctacgtagca gcaaccctgg 17280 

atggttatat cgccactcaa caacacaaac tggattggct ggagaacttt gccctggggg 17340 

atgacgcaac ggcctatgay gatttttatc agacgatcgg agcagtggtc atgggatcgc 17400 

agacctatga atggatcatg tcgaacgctc ccgatgactg gccctaccag gacgtacccg 17460 

cctttgtcat gagcaaccgg gatctgtcag cccccgccaa tttggatatc accttcttac 17520 

gcggcgatgc cagtgccatc gcggtcaggg ccaggcaagc ggcgaagggc aagaatgtct 17580 

ggctggtcgg tggcggcaaa acggcggcct gttttgccaa cgcaggggaa ttacagcagc 17640 

tgttcatcac cactattcca acctttatcg gcaccggcgt tccggtactg cccgtagacc 17700 

gcgcgcttga agtggttctc agagaacaac gcacgctgca gagcggtgcc atggaatgca 17760 

tcctggacgt gaaaaaagcg gattaacgtc tacaagacaa tcgtgtatcg aaactcgcaa 17820 

cgtccaaacc caagggaaaa accagtgaag cgattggtat tgagtttatg tttgttggct 17880 

gttatcgctc tcgccagtgt tcaaggaata aggatggtga aacccgccgc cctgacagcc 17940 

gccgatgctc gcgatatcgg ctatctgaat gtacgcgata gcctttccgt cattgccgcc 18000 

gccccccacc ccaccgcctc acctcgccag gccgttgtca ggcattattt gcgggaaacg 18060 

attgcgggca tgggttacca ggtggttgag caaccctttc tatttaccat cgagagcatg 18120 

gtgaaccggc agaaaaccct ctatgccgag ttgaacgagc agcagcgcca agcgttcgat 18180 

gctgagctgg cccgggtggg cgcggacagt tttgaaaaag aagtgcggat tcgctccggc 18240 

ctactggaag gcgacagcgg ccagggaacc aacttgatag cctcacaccg cgtaccggga 
18300 *"* 

gcgaccgcga cggtcctgtt catggcgcat tacgacagcg tcggcaccgc tcccggtgcc 18360 
agtgacgatg gcatggccgt cgcctcgata ctccaactga tgcgggaaac cataacccgc 18420 
agcgatgcca aaaataacgt tgtctttcta ctcrccgatg gcgaagaact gggcttgctc 18480 
ggagcggagc actacgtctc gcagctcagt acgcctgaac gtgaagccat ccgcctggtg 18540 
ttgaactttg aagcccgggg taaccagggc atccctttac tgttsgagac atcccagaag 18600 



gactacgccc tgatcaggac tgttaacgca ggggttcggg acatcatatc cttctcattc 18660 

acgcccttga tttacaatat gctacaaaac gacaccgact ttacggtgtt caggaaaaag 18720 

aacatsgcgg ggttgaattt tgcagtcgtg gagggttttc agcactacca ccacatgags 18780 

gacaccgtgg agaaccttgs gccagagacc ttgtttcgct accaaaagac agtgcgtgaa 18840 

gtgggcaacc actttatcca gggtatcgac ctctcctccc tgagtgctga tgaggacgca 18900 

acctatttcc cactgccagg cggcacgctg ttggtactca acttacccac cctgtatgcg 18960 

ctgggcatgg gctcgttcgt gctctgcggt ctttgggcgc aacgctgccg cactcgccga 19020 

cagcatcagg gcaagaattg cgtactgcgc cccatggcta ttgccctgct cggcattgcc 19080 

tgcgcagcac ttgtattcta cgtcccgagc attgcctatc tattcgtcat ccccagtctg 19140 

cttctggctt gcgccatgtt gtcgcgaagc ctctttatct cctattcgat catgctgctg 19200 

ggcgcttatg cctgcgggat actctacgcg cctatcgtct acctgatttc atcaggcctt 19260 

aaaatgccgt tcattgccgg ggtcattgca ctactcccgc tctgcctgct ggccgtggga 19320 

ctggccggcg tcatcgcacg atcgagagac tgtcgaacct gcgactagca agacccgata 19380 

aaacgtcgct tcaaacgcca gatgacgtgc ctcgtcagcc aggcgtggaa ccatctggtg 19440 

gcggcaaatg tgcataaggt gggaacgcag agcgcccgct gcaacacgcc caccccaagc 
19500 

accgcgcctc aacggataat caggctcaag ggaattccac cttgcaacct gaaagagcaa 19560 
tcgagcgccc gtcggacaca acaaactgat caccgtcaat tcgggcaagg agcaatccac 19620 
gggcttttgc tccaacctca actccctttg aaaaatcagc cggccacaat ttgcccctac 19680 
cctttcagga tatcctcgat aagcgtttta tcagaacagc gaaaaaccac ttcaagttcg 19740 
tgtacttttt actgcgatct gcgatcgctc ccatggtaca araatgacag atgggaagat 19800 
cgctttaata cctactctct cacctgagaa aaagtaacca ccgggccgta ttcctgatca 19860 
gacactatcg cctgcacaca aaatttcttc tctggaaact tactcagcaa caccatcctc 19920 



catgactgag caataaggtt gcacagttgt tcataaacag cttcatcgac tctatcgctg 19980 

cagcctgcaa aaacatcata caggtgcaca tgattcaaaa ccttctcaac cgccaccaca 20040 

tcacgattac aagactgcat ccaactggag aatgcttcaa cagtaaatcc gctttccaga 20100 

aacacgcccg actcgtaaac aacaaagtct ggaaacaaca cggtggaaaa caccaggaca 
20160 

tcttcaggat gtacaacatc actaataaaa caagactcaa ccaactcgga ttgatccttc 20220 
cattgggact tccagcgctc gtatttggga gacctgacag ggtctgtcgt cctgattatt 20280 
ttcatagcac tgttcctgca ctgatgcctt ttggcgattt tttgtttgag cgagaatcga 20340 
tcgatggctc attgttcatc aagaaaaaaa ttctctcgaa acagactctt aagctcggcc 20400 
cggcgcgttg agaacagtga aaagtcactc gccacatcct gaatattctc tgtgaccctc 20460 
acaagatacc cagtattcag cttttcaatg ggaaagcctg acgcagcaat ccgctcaaca 20520 
tcgacctctt cagcaaactc atcgccgtag aacatagccc aaccgatatt gggcacgtcc 20580 
ggtttcaaag ccgaattgaa tgagccgatc tgaaaacttt tatatttctc atgcgggcta 20640 
agttcgggtt cactgaacag gtgcagcatt ccgagttgag gaggaaaaat ttcacaccat 20700 
gccttgaaga gcgaatacca atcgacgctc ttgctactga ccgagcggta tgtgattgcg 20760 
ccaggaacca cctgcccccg gacattgcgc gccgtgtgga tgacgctccc acagccttta 20820 
accgcctttt ttctgcgcca ggcaaagtcc aggtagaagt ccgataatgc gccgttaaac 20880 
cgtatggccg cctttgatgc ccagcacgct tcactcgcgg ccacgcccat gaaaggctct 20940 
ccgaatttat ccgcattgtg cgacacctgc tctggaacca gtaactcccc ttcccgactt 21000 
aatgacgaaa tgaacgcctc tccaacaccc caaccgatag tttcagactt cgtcctgatt 21060 
gtaatttgca catagggctt cataaatcgt caaagtctcg tcaattcacg ggtgacacaa 21 120 
gtatatccaa agagctctcc cacgttactc atcgcatcga gtctatcaac caaccaacgc 21 180 
cgcctcacgc gccaccgccc acggcgccat caaccgctgc ggggtctggc aggttttgcg 21240 
cttgagcacg aaattgcggc atttctcggc gaattggtgg cggttgtgca ccatgtccaa 21300 



ctgcatctgc gccagctcgg cttcacggca gcgtgtaatc tggtcgatgt ccaacgccgc 2 1360 
tttacgggcg cgagccacgg cgtacttttc atcggttaac gcgctgcttg cctgctgcat 21420 
cagccaacgg ctgaaggcgt gcgggcagcg cgggccgatg ccctgaacca gcccgtattg 21480 
ctcggcttgc aaggcactga tcggcaggca ggcgtcggtg agttgatggg cgacttcact 21540 
gccaacggcg cgtggcaggc tgtaggtcca gtattcggag ccgtacaggc ccatggtttt 21600 
gtaatgcggg ttgagtacca cgctctcgcg ggccaatacg atgtctgcgg ccagcgccag 21660 
cattacacca ccggcgccgg cgctgccggt caggccgctg atcaccagtt gccgggccgt 21720 
gagcagttcg tggcacacat cgtacgatgg cctgaatgtt ggcccaggct tccagccccg 21780 
gcactggggc ggcctggatg acgttgaggt gcacaccatt ggaaaagctg ccgcgcccgc 21840 
ccttgatcac cagcacttgg gtgtcccgcg tcttggccca gcgcaacgcc gccaccagtc 21900 
gctggcactg ctcggtgctc atggcgccgt tgtagaactc aaaggtgagt tcaccgacat 21960 
ggccggcttc gcgatagcga atcggttgat aggcttgctc atcgaacatt tgattggcga 22020 
tcgagctgtc cagcacggga atatccgcca gtgcttccgc cagcacgtgg cgggccggca 22080 
gcttgaaggt ctcctccccc ggccgggctt tgcgtttgag cgagccgatc cacaggctct 22140 
gatcaccggc cgccaccagc accgcgtcgt cctgcaccgc gaggatctca cccggtgtgc 22200 
cgtggcgcgc atccaggtgc gcgtcgtaca ggtaatactg cccgccctgg atactggcca 22260 
gcacaccggg ctggccatcg gctgcgtcga tgcagcgttt gatgaagcgt gcgcaatcgt 22320 
accaactgaa ggtgcgatca gcctgtgtca tgttcggctg caaacgcccg attacgtggg 22380 
cttgggtgta atcgagcggc accgggacga aaacccgggc gaacttttcc accacgtcgc 22440 
ggatgcaata gagggcggcg tcactcaccg cgccgttgta cagctcggat ttgcgcacat 22500 
cggcaggcat gtcgaattca caggtcgacc agatcggccc ggcgtccatt tcctccaccg 22560 
cctgcaaagc cgtgacgccc cagcggccga cctgctggct gatggcccag tccagcgcgc 22620 
tggcaccacg gtcgccgacg atgcccggat ggataatcac cacagggcgc tcaaggttgc 22680 



tccaaagttg ctgtggcaca cggtctttca gaaaggggca gatcaccagg tcggcgtctg 22740 
aatcctcgat ctgctggcac accaaggctg gatcggtgaa cagaacaacg ctgggcgcgt 22800 
gccccgactg gcgtaaatcc agccaggccc gctgggtcaa accgttgaac gccgacgcta 22860 
acacgatgat cttcaatgac cgcatggctg actcatcctt gagaatgcgc ggccagaggt 22920 
gctccttgag ccctccctgg cctttgatgg aagtacaagg atagttggcg tgccaggcag 22980 
gctacctgat caggatcaat cttgtgtcag cgagtgcttg aacgtaggcg cctgcgttca 23040 
accaataggc gcatggsctg gcgagtgctc ccgcgtgccc tctgccacaa gggacgccag 23100 
gtattcgcca aacccgccgc gacacttgta gtcccgacgc gcactggtga ccacgggatt 23160 
ggtcgccgtc cacacgatcc gcgcgccaat cctctcgagg tcggccacca actgcacatc 23220 
ttcatgggca accaaatgct ggaacccacc cgcgtttcga taggcatccg cactcaagcc 23280 
caggttggca ccgtgtatat ggcggtggtt ctcggtgaac tgatacaact caaggtagcg 23340 
cgaacgaacc gattcaccgt actcgctcca gctgtccacc tcgacggttc cgcacaccgc 23400 
atcggcgcca aagccgatct gacgcaccag ccagtcgrcg ggcacaactg tgtcagcgtc 23460 
ggtgaatgcc agccactggg cgccgacttc aagcaatcgc tccgcgccca aggccctggc 23520 
cttgcccaca tttcgaacgc tcacctcaag cgtkgcgaca cccatggccg acacgcgcgt 23580 
ggcggtctcg tccgaacacg catccagcac caccagcaat tggacctgtt ggtgtgccag 23640 
agccggatga gcaatggcgc gctggatgga ggcgaggcag gcactgatgt gccgttcttc 23700 
gttatgggca ggtatcacta tccctatcat tgacgttccc tctaccaggc aaagtgtcta 23760 
23820^ ° CgggCCgtg aggcagaagg tttaaacaat ctgaaggcgc cgccaaacaa 

tgacgtgaga caggtcgcag tgattaaacg gaacgtcaca ggcgccacag gctcagatgg 23880 
tttacgtgtt tgatgcacgg atgaacccgc cattcctaca aacaggtcag ccatcatgtc 23940 
taacgattat caaggtatcg ccagtgtcat cacggcttct cgtcacatgg gtacagactc 24000 



ggatgaacgc cttaatgaga cggtaaatat tcaattgacc tgcagcggta aaccaacgat 24060 

tgcgcggttg agtttcgaca ccccgcttca atggcccggc caccccaact ttgtgctgat 24120 

caacctgccg gacggttcat cggtgggtgg tgtgattgcc gaaattgaaa agtcgaccga 24180 

tggcccgggt tgggtgacgt ttacggtgga tgactgaggt cttcccaaca ggcttcaaat 24240 

cacctccagg cggctgcctc gaatgagaca cacaggccag taatcgagac gcacagacaa 
24300 " 

gcctattttc gcagatacat tttgtaacgt cctatgattg acgcttgctc gaatcaccgc 24360 
agggattggg tggcgtgtgt ttatcacgcc cttgaatccg cagcgaaaaa tgattcgagt 24420 
tcagcgaaca attcgattgg gacaaacaaa aggatgcggg ctatgtcatt gcgtaattta 24480 
tctttattgg tcaccacact ggcgctgttt aagtggggtg taatgcgctc gcggggcaaa 24540 
acccaacatg ctcagtgatg atgacgtgaa atcccaaagc gccggcgcac tgggctatgc 24600 
cccgacagac ctgagcatcg tcaaccgtcg aaccgaaggc accaacacct acgtgctgct 24660 
taaaaccaac gacaacaagc agttcaactg cattatcaac ggaggcaata tcctgacctt 24720 
cggtatgtcc aacccgcctt cgtgtgcgaa gaaaggtgaa cagatcaaga gtggcccgtt 24780 
cgggagctga tctgtcgctg gaaaaaaggg ccaggccacc tctaagaacg gaggcctggc 24840 
ccttttttat tcgctcagat gagtttaaaa gacaagatat cgggcagctg ggctccggcc 24900 
cgttcagtct gggcacccca cacaaaatgc tcagcgacta cttggccgtc gccgcacacc 24960 
gtttaacggg tgcgacctac agcgtcrccc tggttgaagg cagcaacgaa taaaccctat 25020 
tgatcggaga gcgaccatgc acccgcataa aaccgcgatt gtcttgattg aataccagaa 25080 
cgacttcacc acccccggcg gcgtgttcca tgacgctgtg aaagacgtca tgcaaacgtc 25140 
caacatgctg gcgaataccg ccaccacgat tgagcaggcc cgcaagctgg gcgtgaagat 25200 
catccactta cccatccgct ttgccgacgg ctacccagag ctgaccctgc gctcatacgg 25260 
cattctcaaa ggcgtcgccg acggcagcgc gtttcgtgcc ggcagctggg gcgccgagat 25320 



81 

cac^acgcg ctgaaacgcg accccaccga tattgtgatc gaaggcaaac gcggcctgga 

tgctttcgcc accaccgggc tggacctggt gctgcgcaac aatggcatcc agaacctggt 25440 
tgtcgcaggt ttcctgacta actgctgcgt tgaaggcacg gttcgatccg gttacgagaa 25500 
aggttatgac gtggtgacct tgaccgactg caccgcgaca ttcagtgatg aacaacagcg 25560 
cgcagccgag cagtttacgt tgccgatgtt tttcgcaaac cctgcaacac accgcgtttc 25620 
tgcaagcact gaacgccgga taaaaaaagc ggcggactcc tgccgagtcg ccgctttttt 25680 
agtgcttggg tcattcggtt ggcgcgtact gcatttcgcc gttcccaaac gaccagtctt 25740 
cgcgcttcac gtccaccagg ctgatccaca cgtcttcctt gcgcagcccg gtcttggcat 25800 
ggatgccgtc ggcgatgaac ttatagaaag cctttttcac gtcaatgctg cgcccggcgt 25860 
tccacgtgac ttggataaac acgatcttgg gtgtgtaagt gacgccaaga tacccggccg 25920 
ccgggtaaac cagctcatcc ttggcatggc ggttgatgat ctggaatttg tcgtgctcag 25980 
gcacgttggc ccacactggt catcgcggcg tacacgacat caccgatggc cgtcgcggtt 26040 
tcagtggaag tgtcggcggc gaggtcgatt cgaactaaag gcatggacaa atccttagtg 26100 
attttcagct gaaaatgggc gtgtggctca cacactcgcg ccaaccgggc aacttgcgcc 26160 
aggccaacga gttgctggcc cagggagttg ccgacggttt gcgctagtgc gccgcgaaac 26220 
ttcggcattt gacgcatcgg tgaatggctg accggatgtc agtgcttatt gacctgaata 26280 
tagactgccg tgcacagacc aatcaaacaa ataccggcga tgtagtaagc ggcgcccatc 26340 
tgactgtatt gaagcaatag agtaacgacc atcggcgtca ggccaccgaa tacggcgtac 26400 
26460** aggaaaatga caagccggaa aaccgcacta ccggtggaaa ggcacgcacc 

26520 SCag gggCtgCgCC tatc g c § cc g acaaaaaaac cggtaagtga atagagtgga 

accagccatt gcgggtgcgt ttcaagcgtc ttgaacaaga gcagtgcgct gaacagaagc 26580 
atgacgctgc cgatcatcaa tacccarccc gcactgaaat gatcggccag tttcccggcg 26640 



# 
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atcacgcaac caacactcaa rrcacrcaat agcgagactg ttggcctgca aggcttgcgc 26700 



<210> SEQID2 
<211> Lenght: 1004 
<212> Type: PRT 

<213> Organism: Pseudomonas fluorescens A2-2 
<400> SEQUENCE 2 

Met Leu Leu Glu Val Ala Phe His Val lie Thr His Leu Ser Thr Ser 
15 10 15 



Gin Leu Val Ser Arg lie Glu Arg Val Val Glu Arg His Ala Ser Leu 
20 25 30 



Arg Gin Arg Phe Val Met Arg Asn Gly Thr Tyr Trp lie Glu Gin Ala 
35 40 45 



Pro Pro Gin Gin Arg Arg Tyr Cys Val Val Arg Thr Tyr Asp Glu Ala 
50 55 60 



Ser Thr Asp Ala Leu Leu Ala Pro Ser Arg Glu His lie Gly Val Glu 
65 70 75 80 



Ser Glu Arg Leu Phe Arg Ala Glu Val Val Glu Arg Ser Asp Gly Gin 
85 90 95 



Arg Tyr Leu Val Phe Arg He His His He He Ala Asp Leu Tip Ser 
100 105 110 



Val Gly Leu Leu he Arg Asp Phe Ala Glu Asp Cys Met Asp Arg Ser 



tgcag 



26705 



115 



120 



125 



Ser He Thr Leu Ala Ser Arg Pro lie Ala Pro Leu He Asp Pro Glu 



130 



135 



140 



Phe Tip Arg His Gin Met Ser Gin Asp Thr Pro Phe Ser Leu Pro Met 
145 150 155 160 



Ala Ser Leu Glu Gin His Thr Asp Arg Arg Met Val Leu Ser Ser Phe 
165 170 175 



Val He Asp Gin Glu Ser Ser Ala Asp Leu Ala Arg Leu Ala Thr Ala 
180 185 190 



Cys Ala Val Thr Pro Tyr Thr Val Met Leu Ala Ala Gin Val Leu Ala 
195 200 205 



Leu Ser Arg He Gly Gin Ser Gly Arg Leu Ser Leu Ala Val Thr Phe 
210 215 220 



His Gly Arg Asn Arg Gly Asn Lys Asp Ala Val Gly Tyr Phe Ala Asn 
225 230 235 240 



Thr Leu Ala Val Pro Phe Asp Val Ser Glu Cys Ser Val Gly Glu Phe 
245 250 255 



Val Lys Arg Thr Ala Lys Arg Leu Asp Glu Ala Ser Lys Ala Ser Val 
260 265 270 



Gly Ala Gly Tyr Pro Glu Leu Ala Glu Phe Met Thr Pro Leu Gly Trp 
275 280 285 



Ala Ala Thr Ala Pro Thr Asn Ala Val He Tyr Gin Gin Asp Met Pro 
290 295 300 



Gly Met Pro Arg Gly Leu Ala Ala Ala Leu Leu Gly Leu Gly Thr Val 
305 310 315 320 



84 



Gin Leu Gly Glu Met Ala Leu Thr Ala Glu Gin Ala Pro Pro Ser lie 
325 330 335 



Gly Pro Phe Ala Thr Ala Leu Leu Leu Thr Arg His Asp Gly Lys Leu 
340 345 350 



His Gly Arg Val Glu Val Asp Pro Ala Gin His Pro Gly Trp Leu Ala 
355 360 365 



Glu Ala Leu Ala Arg Gin Phe Ala Val lie Leu Arg Glu Met Val Are 
370 375 380 8 



Asp Pro Gin Ala Arg Leu Ser Ala Leu Pro Ala Cys Leu Leu His Gin 
385 390 395 400 



Pro Lys Tyr Pro Ser Gin Ala Arg Pro Ala Pro Ala Ser Glu Thr Leu 
405 410 415 



Val Ala Thr Phe Leu Arg Gin Val Ala He Thr Pro Asp Lys Pro Ala 
420 425 430 



Leu Arg Thr Pro Gin Ala Ser lie Ser Tyr Ser Glu Leu Ala Ser Are 
435 440 445 



Val Ala Arg Leu Ser Ala Ala Leu Arg Val Arg Gly Phe Lys Pro Glu 
450 455 460 



Gin Thr Leu Ala He Leu Leu Pro Arg Asp lie Asn Leu Val Pro Ala 
465 470 475 480 



Leu Leu Ala lie Met Ala Cys Gly Gly Ser Tyr Val Pro Leu Ser Asp 
485 490 495 



Ala Asn Pro Ala Glu Leu Asn Arg Ser He Leu Thr Arg Ala Arg Cys 



85 



500 



505 



510 



Arg Ala lie Leu Thr Asp Gin Glu Gly Leu Thr Arg Phe Ala His Leu 
515 520 525 



Ala Pro Cys Trp Ser Leu Ser Asp Leu Leu Ser Met Pro Asp Ala Pro 
530 535 540 



Leu Gin Asp Gin Ser Lys Leu Gin Ala Lys Ala Tyr He Leu Phe Thr 
545 550 555 560 



Ser Gly Ser Thr Gly Glu Pro Lys Gly Val Ala He Thr His Ala Asn 
565 570 575 



Ala Ala Asn Leu Leu Arg Trp Ala Ala Leu Asp Cys Gly Pro Glu Tyr 
580 585 590 



Leu Ala Gin Thr Leu Ala Ala Thr Pro Thr Thr Phe Asp Leu Ser He 
595 600 605 



Phe Glu Met Phe Ala Pro Leu Met Val Gly Gly Cys Val Gin Pro Val 
610 615 620 



Ser Ser Val Met Ala Leu He Asp Asn Pro Ala Leu Leu Lys Gly Thr 
625 630 635 640 



Thr Leu He Asn Thr Val Pro Ser Val Ala Asp Ala Leu Leu Gin His 
645 650 655 



Asp Val Leu Val Pro Ser Leu Arg Met Leu Asn Leu Ala Gly Glu Pro 
660 665 670 



Leu Asn Arg Asp Leu Tyr Leu Arg Leu Gin Ala Lys Leu Thr Ala Thr 
675 680 685 



86 



Arg He Val Asn Leu Tyr Gly Pro Thr Glu Thr Thr Thr Tyr Ser Thr 
690 695 700 



Ala Leu Val lie Glu Pro Ala Gin Gin Glu lie Thr lie Gly Phe Pro 
705 710 715 720 



Leu Tyr Gly Thr Trp Val Asp Val Val Asp Gin Asn Met Gin Ser Val 
725 730 735 



Gly lie Gly Val Pro Gly Glu Leu He He His Gly His Gly Val Ala 
740 745 750 



Gin Gly Tyr Val Ser Asp Pro Val Arg Ser Ala Ala Ser Phe Leu Pro 
755 760 765 



Ala Ser Asp Gly Leu Arg Cys Tyr Arg Thr Gly Asp Arg Val Arg Trp 
770 775 780 



Leu Pro Asp Gly Arg Leu Asp Phe lie Gly Arg Glu Asp Asp Gin Val 
785 790 795 800 



Lys Val Arg Gly Phe Arg Val Glu Leu Gly Pro Val Gin Ala Ala Leu 
805 810 815 



His Ala He Glu Thr He His Glu Ser Ala Val Val Val Val Pro Lys 
820 825 830 



Gly Gin Gin Arg Ser lie Val Ala Phe He Val Leu Lys Ala Pro Ser 
835 840 845 



Glu Asp Glu Ala Val Gin Arg Asn Asn lie Lys Gin His Leu Leu Gly 
850 855 860 



Val Leu Pro Tyr Tyr Ala Leu Pro Asp Lys Phe He Phe Val Lys Ala 



865 870 875 880 



Leu Pro Arg Asn Thr His Gly Lys He Asp Arg Thr Leu Leu Leu Gin 
885 890 895 



His Glu Pro Gin Thr Glu Gin Glu Ser Ala Met Arg Asp Ala Thr Asp 
900 905 910 



Val Glu His Arg lie Ala Asn Cys Trp Gin Thr He lie Gly His Pro 
915 920 925 



Val Gin Leu His Glu Asn Phe Leu Asp lie Gly Gly His Ser Leu Ser 
930 935 940 



Leu Thr His Leu Thr Gly Leu Leu Arg Lys Glu Phe Asn lie His He 
945 950 955 960 



Ser Leu His Asp Leu Trp lie Arg Pro Thr He Glu Gin Gin Ala Asp 
965 970 975 



Phe He His Lys Leu Gin Asn Ser Val Leu Thr Lys Pro Ala Ala Ala 
980 985 990 



Pro He Pro Arg Leu Asp Arg Lys He Ser His His 
995 1000 



<210> SEQID3 
<211> Lenght: 1062 
<212> Type: PRT 

<213> Organism: Pseudomonas fluorescens A2-2 
<400> SEQUENCE 3 



Met Ser Val Asp Thr Cys Arg Thr Ala Thr Phe Pro Ala Ser Tyr Gly 
1 5 10 15 



Gin Glu Gin He Trp Phe Leu Asn Glu Leu Asn Pro His Ser Gin Leu 
20 25 30 



Ala Tyr Thr Leu Ala Met Lys Val Ser He Ala Gly Lys Leu Asn Thr 
35 40 45 



Leu Arg Leu Gin Arg Ala Val Asn Gin Val Val Ala Ser Gin Glu He 
50 55 60 



Leu Arg Thr Ser Phe Ala Tyr Lys Asn Gin Lys Leu Ser Gin Val He 
55 70 75 80 



88 



Ser Pro Ser Ala Thr Leu Pro lie Arg Ser Ala His Cys He Asp Asp 



90 



95 



Val Pro Gly Leu Gin Arg Leu lie Asn Met Glu Ala Gin Arg Gly Trp 



105 



110 



Ser Leu Ser Ser Ala Pro Leu Tyr Arg Leu Leu Leu He Lys Thr Gly 
115 120 io«; 



125 



^130" Ghl His ° 1 ?c Leu Val De ^ Thr His His ne Val °y s As P 



135 



140 



Gly He Ser Leu Gin Leu Leu Leu Gin Lys He Val Ser Ala Tyr Gin 



150 



155 



160 



Gly Gin Ser Asp Gly Arg Val Leu Thr Ser Pro Asp Glu Glu Thr Leu 
165 170 175 



Gin Phe Val Asp Tyr Ala Ala Trp Ser Arg Gin His Glu Tyr Ala Gly 



185 



190 



Leu Glu Tyr Trp Arg Gin Gin Leu Ala Asp Ala Pro Thr He Leu Asp 



200 



205 



e 



89 



lie Ser Thr Lys Thr Gly Arg Ser Glu Gin Gin Thr Phe Leu Gly Ala 
210 215 220 



Arg He Pro Val Glu Phe Ser His His Gin Tip Gin Ala Leu Arg Gin 
225 230 235 240 



He Phe Arg Pro Gin Gly lie Ser Cys Ala Ala Val Phe Leu Ala Ala 
245 250 255 



Tyr Cys Val Val Leu His Arg Leu Ala Glu Gin Asp Asp lie Leu He 
260 265 270 



Gly Leu Pro Thr Ser Asn Arg Leu Arg Pro Glu Leu Ala Gin Val He 
275 280 285 



Gly Tyr Leu Ser Asn Leu Cys Val Phe Arg Ser Gin Tyr Ala His Asp 
290 295 300 



Gin Ser Val Thr Asp Phe Leu Gin Gin Val Gin Leu Thr Leu Pro Asn 
305 310 315 320 



Leu He Glu His Gly Glu Thr Pro Phe Gin Gin Val Leu Glu Ser Val 
325 330 335 



Glu His Thr Arg Gin Ala Gly Val Thr Pro Leu Cys Gin Val Leu Phe 
340 345 350 



Gly Tyr Glu Gin Asp Val Arg Arg Thr Leu Asp He Gly Asp Leu Gin 
355 360 365 



Leu Thr Val Ser Asp Val Asp Thr Gly Ala Ala Arg Leu Asp Leu Ser 
370 375 380 



90 



Leu Phe Leu Phe Glu Asp Glu Leu Asn Val Cys Gly Phe Leu Glu Tyr 
385 390 395 400 



Ala Thr Asp Arg He Asp Ala Ala Ser Ala Gin Asn Met Val Arg Met 
405 410 415 



Leu Ser Ser Val Leu Arg Glu Phe Val Ala Ala Pro Gin Ala Pro Leu 
420 425 430 



Ser Glu Val Gin Leu Gly Ala Ala Asp Ser Gin Ala Gin Thr Pro Ala 
435 440 445 



lie Ala Pro Ala Phe Pro Ser Val Pro Ala Arg Leu Phe Ala Leu Ala 
450 455 460 



Asp Ser His Pro Asn Ala Thr Ala Leu Arg Asp Glu Gin Gly Glu Leu 
465 470 475 480 



Thr Tyr Ala Gin Val Cys Gin Gin lie Leu Gin Ala Ala Ala Thr Leu 
485 490 495 



Arg Ala Gin Gly Ala Lys Pro Gly Thr Leu He Ala Val lie Gly Glu 
500 505 510 



Arg Gly Asn Pro Tip Leu lie Ala Met Leu Ala lie Trp Gin Val Gly 
515 520 525 



Gly He Tyr Val Pro Leu Ser Lys Asp Leu Pro Glu Gin Arg Leu Gin 
530 535 540 



Gly He Leu Ala Glu Leu Glu Gly Ala He Leu He Thr Asp Asp Thr 
545 550 555 560 



Thr Pro Glu Arg Phe Arg Gin Arg Val Thr Leu Pro Met His Ala Leu 
565 570 575 



Trp Ala Asp Gly Ala Thr His His Glu Arg Gin Thr Thr Asp Ala Ser 
580 585 590 



Arg Leu Ser Gly Tyr Met Met Tyr Thr Ser Gly Ser Thr Gly Lys Pro 
595 600 605 



Lys Gly Val His Val Ser Gin Ala Asn Leu Val Ala Thr Leu Ser Ala 
610 615 620 



Phe Gly Gin Leu Leu Gin Val Lys Pro Ser Asp Arg Met Leu Ala Leu 
62 5 630 635 640 



Thr Thr Phe Ser Phe Asp He Ser Leu Leu Glu Leu Leu Leu Pro Leu 
645 650 655 



Val Gin Gly Ala Ser Val Gin lie Ala Val Ala Gin Ala Gin Arg Asp 
660 665 670 



Ala Glu Lys Leu Ala Gly Tyr Leu Ala Asp Pro Arg He Thr Leu Val 
675 680 685 



Gin Ala Thr Pro Val Thr Trp Arg Leu Leu Leu Ser Thr Gly Trp Gin 
690 695 700 



Pro Arg Glu Ser Leu Thr Leu Leu Cys Gly Gly Glu Ala Leu Pro Gin 
705 710 715 720 



Asp Leu Ala Asp Arg Leu Cys Leu Pro Gly Met Thr Leu Trp Asn Leu 
725 730 735 



Tyr Gly Pro Thr Glu Thr Thr lie Trp Ser Thr Ala Cys Are Leu Gin 
740 745 750 
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Pro Gly Ala Pro Val Gin Leu Gly His Pro lie Ala Gly Thr Gin lie 
755 760 765 



Ala Leu Val Asp Arg Asn Leu Arg Ser Val Pro Arg Gly Val lie Gly 
770 775 780 



Glu Leu Leu He Cys Gly Pro Gly Val Ser Gin Gly Tyr Tyr Are Asn 
785 790 795 800 



Pro Val Glu Thr Ala Lys Arg Phe Val Pro Asp Pro His Gly Ser Gly 
805 810 815 



Lys Arg Ala Tyr Leu Thr Gly Asp Arg Met Arg Met Gin Gin Asp Gly 
820 825 830 



Ser Leu Ala Tyr lie Gly Arg Arg Asp Asp Gin lie Lys Leu Arg Gly 
835 840 845 



His Arg He Glu Leu Gly Glu lie Glu Thr Ala Leu Arg Lys Leu Pro 
850 855 860 



Gly Val Arg Asp Ala Ala Ala Gin Leu His Asp Gin Asp Pro Ser Arg 
865 870 875 880 



Gly lie Gin Ala Phe Val Gin Leu Cys Ala Thr Val Asp Glu Ser Leu 
885 890 895 



He Asp lie Gly Gin Trp Leu Glu Thr Leu Arg Gin Thr Leu Pro Glu 
900 905 910 



Ala Trp Leu Pro Thr Glu Tyr Tyr Arg He Asp Gly He Pro Leu Thr 
915 920 925 



Tyr Asn Gly Lys Arg Asp Arg Lys Arg Leu Leu His Gin Ala Val 
930 935 940 



e 



Leu Gin Thr Leu Ser Leu Arg Val Ala Pro Ser Ser Asp Thr Glu Thr 
945 950 955 960 



Arg Val Gin Gin He Tip Cys Glu Leu Leu Gly Leu Glu Asp He Gly 
965 970 975 



Val Thr Asp Asp Phe Phe Gin Leu Gly Gly His Ser lie Leu Val Ala 
980 985 990 



Arg Met Val Glu Arg lie Glu Thr Ala Phe Gly Arg Arg Val Pro He 
995 1000 1005 



Ala Asp He Tyr Phe Ser Pro Thr He Ala Arg Val Ala Ala Thr 
1010 1015 1020 



Leu Asp Ser Met Thr Phe Glu Gin Gly Leu Ala Ala His Ser Val 
1025 1030 1035 



Lys Gly Asp Trp Glu Phe Thr Ala He Ser Leu Gin His Asn Ala 
1040 1045 1050 



Asp Ser Thr Ala Ala Ala Gin Glu Arg 
1055 1060 



<210> SEQ ID 4 
<211> Length: 1432 
<212> Type: PRT 

<213> Organism: Pseudomonas fluorescens A2-2 
<400> SEQUENCE 4 



93 



Met His Ser Pro Thr He Asp Thr Phe Glu Ala Ala Leu Arg Ser Leu 
15 10 15 
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Pro Ala Ala Arg Asp Ala Leu Gly Ala Tyr Pro Leu Ser Ser Glu Gin 
20 25 30 



Lys Arg Leu Tip Leu Leu Ala Gin Leu Ala Gly Thr Ala Thr Leu Pro 
35 40 45 



Val Thr Val Arg Tyr Ala Phe Thr Gly Thr Val Asp Leu Ala Val Val 
50 55 60 



Gin Gin Asn Leu Ser Ala Trp He Ala His Ser Glu Ser Leu Arg Ser 
65 70 75 80 



Leu Phe Val Glu Val Leu Glu Arg Pro Val Arg Leu Leu Met Pro Thr 
85 90 95 



Gly Leu Val Lys Leu Glu Tyr Phe Asp Arg Pro Pro Ser Asp Ala Asp 
100 105 110 



Met Ala Glu Leu lie Gly Ala Ala Phe Glu Leu Asp Lys Gly Pro Leu 
115 120 125 



Leu Arg Ala Phe lie Thr Arg Thr Ala Ala Gin Gin His Glu Leu His 
130 135 140 



Leu Val Gly His Pro lie Val Val Asp Glu Pro Ser Leu Gin Arg He 
145 150 155 160 



Ala Gin Thr Leu Phe Gin Thr Glu Pro Asp His Gin Tyr Pro Ala Val 
165 170 175 



Gly Ala lie Ala Glu Val Phe Gin Arg Glu Gin Thr Leu Ala Gin Asp 
180 185 190 



Ala Gin lie Thr Glu Gin Trp Gin Gin Trp Gly He Gly Leu Gin Ala 
195 200 205 



Pro Ala Ala Thr Glu lie Pro Thr Glu Asn Pro Arg Pro Ala He Lys 
210 215 220 



Gly Ser Asp Arg Gin Val His Glu Ala Leu Thr Ala Trp Gly Asp Gin 
225 230 235 240 



Pro Val Ala Glu Ala Glu He Val Ser Ser Trp Leu Thr Val Leu Met 
245 250 255 



Arg Trp Gin Gly Ser Gin Ser Ala Leu Cys Ala He Lys Val Arg Asp 
260 265 270 



Lys Ala His Ala Asn Leu lie Gly Pro Leu Gin Thr Tyr Leu Pro Val 
275 280 285 



Arg Val Asp Met Pro Asp Gly Ser Thr Leu Ala Gin Leu Arg Leu Gin 
290 295 300 



Val Glu Glu Gin Leu Asn Gly Asn Asp His Pro Ser Phe Ser Thr Leu 
305 310 315 320 



Leu Glu Val Cys Pro Pro Lys Arg Asp Leu Ser Arg Thr Pro Tyr Phe 
325 330 335 



Gin Thr Gly Leu Gin Phe He Ala His Asp Val Glu Gin Arg Asp Phe 
340 345 350 



His Ala Gly Asn Leu Thr Arg Leu Pro Thr Lys Gin Pro Ser Ser Asp 
355 360 365 



Leu Asp Leu Phe He Ser Cys Trp Val Ser Asp Gly Thr Leu Gly Leu 
370 375 380 
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Thr Leu Asp Tyr Asp Cys Ala Val Leu Asn Ser Ser Gin Val Glu Val 
385 390 395 400 



Leu Ala Gin Ala Leu He Ser Val Leu Ser Ala Pro Gly Glu Gin Pro 
405 410 415 



He Ala Thr Val Ala Leu Met Gly Gin Gin Met Gin Gin Thr Val Leu 
420 425 430 



Ala Gin Ala His Gly Pro Arg Thr Thr Pro Pro Gin Leu Thr Leu Thr 
435 440 445 



Glu Trp Val Ala Ala Ser Thr Glu Lys Ser Pro Leu Ala Val Ala Val 
450 455 460 



He Asp His Gly Gin Gin Leu Ser Tyr Ala Glu Leu Trp Ala Arg Ala 
465 470 475 480 



Ala Leu Val Ala Ala Asn lie Ser Gin His Val Ala Lys Pro Arg Ser 
485 490 495 



lie lie Ala Val Ala Leu Pro Arg Ser Ala Glu Phe lie Ala Ala Leu 
500 505 510 



Leu Gly Val Val Arg Ala Gly His Ala Phe Leu Pro He Asp Pro Arg 
515 520 525 



Leu Pro Thr Asp Arg lie Gin Phe Leu He Glu Asn Ser Gly Cys Glu 
530 535 540 



Leu Val He Thr Ser Asp Gin Gin Ser Val Glu Gly Trp Pro Gin Val 
545 550 555 560 



Ala Arg He Arg Met Glu Ala Leu Asp Pro Asp He Arg Trp Val Ala 
565 570 575 
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Pro Thr Gly Leu Ser His Ser Asp Ala Ala Tyr Leu lie Tyr Thr Ser 
580 585 590 



Gly Ser Thr Gly Val Pro Lys Gly Val Val Val Glu His Arg Gin Val 
595 600 605 



Val Asn Asn lie Leu Tip Arg Gin Arg Thr Tip Pro Leu Thr Ala Gin 
610 615 620 



Asp Asn Val Leu His Asn His Ser Phe Ser Phe Asp Pro Ser Val Trp 
625 630 635 640 



Ala Leu Phe Trp Pro Leu Leu Thr Gly Gly Thr He Val Leu Ala Asp 
645 650 655 



Val Arg Thr Met Glu Asp Ser Thr Ala Leu Leu Asp Leu Met lie Arg 
660 665 670 



His Asp Val Ser Val Leu Gly Gly Val Pro Ser Leu Leu Gly Thr Leu 
675 680 685 



He Asp His Pro Phe Ala Asn Asp Cys Arg Ala Val Lys Leu Val Leu 
690 695 700 



Ser Gly Gly Glu Val Leu Asn Pro Glu Leu Ala His Lys He Gin Lys 
705 710 715 720 



Val Trp Gin Ala Asp Val Ala Asn Leu Tyr Gly Pro Thr Glu Ala Thr 
725 730 735 



lie Asp Ala Leu Tyr Phe Ser He Asp Lys Asn Ala Ala Gly Ala lie 
740 745 750 
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Pro He Gly Tyr Pro lie Asp Asn Thr Asp Ala Tyr lie Val Asp Leu 
755 760 765 



Asn Leu Asn Pro Val Pro Pro Gly Val Pro Gly Glu He Met Leu Ala 
770 775 780 



Gly Gin Asn Leu Ala Arg Gly Tyr Leu Gly Lys Pro Ala Gin Thr Ala 
785 790 795 " 800 



Gin Arg Phe Leu Pro Asn Pro Phe Gly Asn Gly Arg Val Tyr Ala Thr 
805 810 815 



Gly Asp Leu Gly Arg Arg Trp Ser Ser Gly Ala lie Ser Tyr Leu Gly 
820 825 830 



Arg Arg Asp Gin Gin Val Lys He Arg Gly His Arg He Glu Leu Asn 
835 840 845 



Glu Val Ala His Leu Leu Cys Gin Ala Leu Glu Leu Lys Glu Ala He 
850 855 860 



Val Phe Ala Gin His Ala Gly Thr Glu Gin Ala Arg Leu Val Ala Ala 
865 870 875 880 



He Glu Gin Gin Pro Gly Leu His Ser Glu Gly lie Lys Gin Glu Leu 
885 890 895 



Leu Arg His Leu Pro Ala Tyr Leu He Pro Ser Gin Leu Leu Leu Leu 
900 905 910 



Asp Glu Leu Pro Arg Thr Ala Thr Gly Lys Val Asp Met Leu Lys Leu 
915 920 925 



Asp Gin Leu Ala Ala Pro Gin Leu Asn Asp Ala Gly Gly Thr Glu 
930 935 940 



c 



# 
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Arg Ala Pro Arg Thr Asp Leu Glu Gin Ser Val Met Thr Asp Phe Ala 
945 950 955 960 



Gin Val Leu Gly Leu Thr Ala Val Thr Pro Asp Thr Asp Phe Phe Glu 
965 970 975 



Gin Gly Gly Asn Ser He Leu Leu Thr Arg Leu Ala Gly Thr Leu Ser 
980 985 990 



Ala Lys Tyr Gin Val Gin He Pro Leu His Glu Phe Phe Leu Thr Pro 
995 1000 1005 



Thr Pro Ala Ala Val Ala Gin Ala He Glu lie Tyr Arg Arg Glu 
1010 1015 1020 ~ " " 

Gly Leu Thr Ala Leu Leu Ser Arg Gin His Ala Gin Thr Leu Glu 
1025 1030 1035 

Gin Asp He Tyr Leu Glu Glu His lie Arg Pro Asp Gly Leu Pro 
1040 1045 1050 

His Ala Asn Trp Tyr Gin Pro Ser Val Val Phe Leu Thr Gly Ala 
1055 1060 1065 



Thr Gly Tyr Leu Gly Leu Tyr Leu He Glu Gin Leu Leu Lys Arg 
1070 1075 1080 



Thr Thr Ser Arg Val He Cys Leu Cys Arg Ala Lys Asp Ala Glu 
1085 1090 1095 



His Ala Lys Ala Arg He Leu Glu Gly Leu Lys Thr Tyr Arg lie 
1100 1105 1110 



100 



Asp Val Gly Ser Glu Leu His Arg Val Glu Tyr Leu Thr Gly Asp 
1115 1120 H25 

Leu Ala Leu Pro His Leu Gly Leu Ser Glu His Gin Trp Gin Thr 
1130 H35 H40 



Leu Ala Glu Glu Val Asp Val lie Tyr His Asn Gly Ala Leu Val 
1145 H50 1155 



Asn Phe Val Tyr Pro Tyr Ser Ala Leu Lys Ala Thr Asn Val Gly 
1160 H65 1170 

Gly Thr Gin Ala He Leu Glu Leu Ala Cys Thr Ala Arg Leu Lys 
1175 U80 H85 



Ser Val Gin Tyr Val Ser Thr Val Asp Thr Leu Leu Ala Thr His 
1190 1195 1200 



Val Pro Arg Pro Phe lie Glu Asp Asp Ala Pro Leu Arg Ser Ala 
1205 1210 1215 



Val Gly Val Pro Val Gly Tyr Thr Gly Ser Lys Trp ValAlaGlu 
1220 1225 1230 



Gly Val Ala Asn Leu Gly Leu Arg Arg Gly He Pro Val Ser He 
1235 1240 1245 



Phe Arg Pro Gly Leu lie Leu Gly His Thr Glu Thr Gly Ala Ser 
1250 1255 1260 



Gin Ser He Asp Tyr Leu Leu Val Ala Leu Arg Gly Phe Leu Pro 
1265 1270 1275 



Met Gly lie Val Pro Asp Tyr Pro Arg He Phe Asp lie Val Pro 
1280 1285 1290 



Val Asp Tyr Val Ala Ala Ala lie Val His lie Ser Met Gin Pro 
1295 1300 1305 



GlnGly Arg Asp Lys Phe Phe His Leu Phe Asn Pro Ala Pro Val 
1310 1315 1320 



Thrlle Arg Gin Phe Cys Asp Trp lie Arg Glu Phe Gly Tyr Glu 
1325 1330 1335 



Phe Lys Leu Val Asp Phe Glu His Gly Arg Gin Gin Ala Leu Ser 
1340 1345 1350 



Val Pro Pro Gly His Leu Leu Tyr Pro Leu Val Pro Leu lie Arg 
1355 1360 1365 



Asp Ala Asp Pro Leu Pro His Arg Ala Leu Asp Pro Asp Tyr He 
1370 1375 1380 



His Glu Val Asn Pro Ala Leu Glu Cys Lys Gin Thr Leu Glu Leu 
1385 1390 1395 



Leu Ala Ser Ser Asp He Thr Leu Ser Lys Thr Thr Lys Ala Tyr 
1400 1405 1410 



Ala His Thr lie Leu Arg Tyr Leu lie Asp Thr Gly Phe Met Ala 
1415 1420 1425 



Lys Pro Gly Val 
1430 



<210> SEQ ID 5 
<211> Lenght: 350 
<212> Type: PRT 



<213> Organism: Pseudomonas fluorescens A2-2 
<400> SEQUENCE 5 

Met Glu Ser He Ala Phe Pro lie Ala His Lys Pro Phe lie Leu Gly 
1 5 10 15 



Cys Pro Glu Asn Leu Pro Ala Thr Glu Arg Ala Leu Ala Pro Ser Ala 
20 25 30 



Ala Met Ala Arg Gin Val Leu Glu Tyr Leu Glu Ala Cys Pro Gin Ala 
35 40 45 



Lys Asn Leu Glu Gin Tyr Leu Gly Thr Leu Arg Glu Val Leu Ala His 
50 55 60 



Leu Pro Cys Ala Ser Thr Gly Leu Met Thr Asp Asp Pro Arg Glu Asn 
65 70 75 80 



Gin Glu Asn Arg Asp Asn Asp Phe Ala Phe Gly He Glu Arg His Gin 
85 90 95 



Gly Asp Thr Val Thr Leu Met Val Lys Ala Thr Leu Asp Ala Ala He 
100 105 HO 



Gin Thr Gly Glu Leu Val Gin Arg Ser Gly Thr Ser Leu Asp His Ser 
115 120 125 



Glu Trp Ser Asp Met Met Ser Val Ala Gin Val He Leu Gin Thr He 
130 135 140 



Ala Asp Pro Arg Val Met Pro Glu Ser Arg Leu Thr Phe Gin Ala Pro 
245 150 155 160 



102 



Lys Ser Lys Val Glu Glu Asp Asp Gin Asp Pro Leu Arg Arg Trp Val 
165 170 175 
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Arg Gly His Leu Leu Phe Met Val Leu Cys Gin Gly Met Ser Leu Cys 
180 185 190 



Thr Asn Leu Leu He Ser Ala Ala His Asp Lys Asp Leu Glu Leu Ala 
195 200 205 



Cys Ala Gin Ala Asn Arg Leu He Gin Leu Met Asn lie Ser Arg lie 
210 215 220 



Thr Leu Glu Phe Ala Thr Asp Leu Asn Ser Gin Gin Tyr Val Ser Gin 
225 230 235 240 



He Arg Pro Thr Leu Met Pro Ala lie Ala Pro Pro Lys Met Ser Gly 
245 250 255 



lie Asn Trp Arg Asp His Val Val Met He Arg Trp Met Arg Gin Ser 
260 265 270 



Thr Asp Ala Trp Asn Phe He Glu Gin Ala Tyr Pro Gin Leu Ala Glu 
275 280 285 



Arg Met Arg Thr Thr Leu Ala Gin Val Tyr Ser Ala His Arg Gly Val 
290 295 300 



Cys Glu Lys Phe Val Gly Glu Glu Asn Thr Ser Leu Leu Ala Lys Glu 
305 310 315 320 



Asn Ala Thr Asn Thr Ala Gly Gin Val Leu Glu Asn Leu Lys Lys Ser 
325 330 335 



Arg Leu Lys Tyr Leu Lys Thr Lys Gly Cys Ala Gly Ala Gly 
340 345 350 
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<210> SEQID6 
<211> Lenght:6l 
<212> Type: PRT 

<213> Organism: Pseudomonas fluorescens A2-2 
<400> SEQUENCE: 6 

Met Pro Thr Phe Leu Gly Asp Asp Asp Ala Val Pro Cys Val Val Val 
15 10 15 

Val Asn Ala Asp Lys His Tyr Ser He Trp Pro Ser Ala Arg Asp He 
20 25 30 

Pro Ser Gly Trp Ser Glu Glu Gly Phe Lys Gly Ser Arg Ser Asp Cys 

Leu Glu His lie Ala Gin lie Trp Pro Glu Pro Thr Ala 
50 55 60 



<210> SEQ ID 7 
<211> Lenght: 355 
<212> Type: PRT 

< ?1 3> ^S^s 111 : Pseudomonas fluorescens A2-2 
<400> SEQUENCE 7 

Met Thr Ser Thr His Arg Thr Thr Asp Gin Val Lys Pro Ala Val Leu 
x 5 10 15 



Asp Met Pro Gly Leu Ser Gly He Leu Phe Gly His Ala Ala Phe Gin 
^° 25 30 

Tyr Leu Arg Ala Ser Cys Glu Leu Asp Leu Phe Glu His Val Arg Asp 

Leu Arg Glu Ala Thr Lys Glu Ser lie Ser Ser Arg Leu Lys Leu Gin 
ou 55 60 
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Glu Arg Ala Ala Asp He Leu Leu Leu Gly Ala Thr Ser Leu Gly Met 
65 70 75 80 



Leu Val Lys Glu Asn Gly He Tyr Arg Asn Ala Asp Val Val Glu Asp 
85 90 95 



Leu Met Ala Thr Asp Asp Tip Gin Arg Phe Lys Asp Thr Val Ala Phe 
100 105 HO 



Glu Asn Tyr lie Val Tyr Glu Gly Gin Leu Asp Phe Thr Glu Ser Leu 
115 120 125 



Gin Lys Asn Thr Asn Val Gly Leu Gin Arg Phe Pro Gly Glu Gly Arg 
130 135 140 



Asp Leu Tyr His Arg Leu His Gin Asn Pro Lys Leu Glu Asn Val Phe 
145 150 155 160 



Tyr Arg Tyr Met Arg Ser Trp Ser Glu Leu Ala Asn Gin Asp Leu Val 
165 170 175 



Lys His Leu Asp Leu Ser Arg Val Lys Lys Leu Leu Asp Ala Gly Gly 
180 185 190 



Gly Asp Ala Val Asn Ala lie Ala Leu Ala Lys His Asn Glu Gin Leu 
195 200 205 



Asn Val Thr Val Leu Asp He Asp Asn Ser He Pro Val Thr Gin Glv 
210 215 220 



Lys He Asn Asp Ser Gly Leu Ser His Arg Val Lys Ala Gin Ala Leu 
225 230 235 ~ 240 



Asp lie Leu His Gin Ser Phe Pro Glu Gly Tyr Asp Cys lie Leu Phe 
245 250 255 
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Ala His Gin Leu Val He Tip Thr Leu Glu Glu Asn Thr His Met Leu 
260 265 270 



Arg Lys Ala Tyr Asp Ala Leu Pro Glu Gly Gly Arg Val Val lie Phe 
275 280 285 



Asn Ser Met Ser Asn Asp Glu Gly Asp Gly Pro Val Met Ala Ala Leu 
290 295 300 



Asp Ser Val Tyr Phe Ala Cys Leu Pro Ala Glu Gly Gly Met He Tvr 
305 310 315 320 

Ser Trp Lys Gin Tyr Glu Val Cys Leu Ala Glu Ala Gly Phe Lys Asn 
325 330 335 

Pro Val Arg Thr Ala lie Pro Gly Trp Thr Pro His Gly lie lie Val 
340 345 350 



Ala Tyr Lys 
355 



<210> SEQ ID 8 
<211> Lenght347 
<212> Type: PRT 

<213> Organism: Pseudomonas fluorescens A2-2 
<400> SEQUENCE 8 

Met Ala Arg Ser Pro Glu Thr Asn Ser Ala Met Pro Gin Gin lie Arg 
1 5 10 15 



Gin Leu L^u Tyr Ser Gin Leu lie Ser Gin Ser He Gin Thr Phe Cys 
20 25 30 
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Glu Leu Arg Leu Pro Asp Val Leu Gin Ala Ala Gly Gin Pro Thr Ser 
35 40 45 



lie Glu Arg Leu Ala Glu Gin Thr His Thr His lie Ser Ala Leu Ser 
50 55 60 



Arg Leu Leu Lys Ala Leu Lys Pro Phe Gly Leu Val Lys Glu Thr Asp 
65 70 75 80 



Glu Gly Phe Ser Leu Thr Asp Leu Gly Ala Ser Leu Thr His Asp Ala 
85 90 95 



Phe Ala Ser Ala Gin Pro Ser Ala Leu Leu lie Asn Gly Glu Met Gly 
100 105 110 



Gin Ala Trp Arg Gly Met Ala Gin Thr lie Arg Thr Gly Glu Ser Ser 
115 120 125 



Phe Lys Met Tyr Tyr Gly lie Ser Leu Phe Glu Tyr Phe Glu Gin His 
130 135 140 



Pro Glu Arg Arg Ala lie Phe Asp Arg Ser Gin Asp Met Gly Leu Asp 
145 150 155 160 



Leu Glu He Pro Glu lie Leu Glu Asn He Asn Leu Asn Asp Gly Glu 
165 170 175 



Asn He Val Asp Val Gly Gly Gly Ser Gly His Leu Leu Met His Met 
180 185 190 



Leu Asp Lys Trp Pro Glu Ser Thr Gly lie Leu Phe Asp Leu Pro Val 
195 200 205 



Ala Ala Lys lie Ala Gin Gin His Leu His Lys Ser Gly Lys Ala Gly 
210 215 220 
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Cys Phe Glu DeVU Ala Gly AspPhe Phe Lys Ser Leu Pro Asp Ser 

Gly Ser Val Tyr Leu Leu Ser His Val Leu His Asp Trp Gly Asp Glu 

250 255 

Asp Cys Lys Ala He Leu Ala Thr Cys Arg Arg Ser Met Pro Asp Asn 

265 270 

Ala Leu Leu Val Val Val Asp Leu Val lie Asp Gin Ser Glu Ser Ala 
Gin Pro Asn Pro Thr Gly Ala Met Met Asp Leu Tyr Met Leu Ser Leu 

^ u -^yo 300 

Phe Gly He Ala Gly Gly Lys Glu Arg Asn Glu Asp Glu Phe Arg Thr 

Leu He Glu Asn Ser Gly Phe Asn Val Lys Gin Val Lys Arg Leu Pro 
c525 330 335 

Ser Gly Asn Gly He He Phe Ala Tyr Pro Lys 
340 345 



<210> SEQID9 
<211> Lenght: 180 
<212> Type: PRT 

< ?o2 > Organism: Pseudomonas fluorescens A2-2 
<400> SEQUENCE: 9 

Met Ser Thr Leu Val Tyr Tyr Val Ala Ala Thr Leu Asp Gly Tyr He 
1 5 io 15 J y 

Ala Thr Gin Gin His Lys Leu Asp Trp Leu Glu Asn Phe Ala Leu Gly 
^ u 25 30 J 
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Asp Asp Ala Thr Ala Tyr Asp Asp Phe Tyr Gin Thr lie Gly Ala Val 
35 40 . 45 



Val Met Gly Ser Gin Thr Tyr Glu Trp lie Met Ser Asn Ala Pro Asp 
50 55 60 



Asp Trp Pro Tyr Gin Asp Val Pro Ala Phe Val Met Ser Asn Arg Asp 
65 70 75 80 



Leu Ser Ala Pro Ala Asn Leu Asp lie Thr Phe Leu Arg Gly Asp Ala 
85 90 95 



Ser Ala He Ala Val Arg Ala Arg Gin Ala Ala Lys Gly Lys Asn Val 
100 105 110 



Trp Leu Val Gly Gly Gly Lys Thr Ala Ala Cys Phe Ala Asn Ala Gly 
115 120 125 



Glu Leu Gin Gin Leu Phe lie Thr Thr lie Pro Thr Phe lie Gly Thr 
130 135 140 



Gly Val Pro Val Leu Pro Val Asp Arg Ala Leu Glu Val Val Leu Arg 
145 150 155 160 



Glu Gin Arg Thr Leu Gin Ser Gly Ala Met Glu Cys He Leu Asp Val 
165 170 175 



Lys Lys Ala Asp 
180 



<210> SEQ ID 10 
<211> Length: 220 
<212> Type: PRT 
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<213> Organism: Pseudomonas fluorescens A2-2 
<400> SEQUENCE: 10 

Met Ser Asn Val Phe Ser Gly Gly Lys Gly Asn Gly Asn Pro Gly Phe 
15 10 15 

Val Arg Thr Phe Ser Arg lie Ala Pro Thr Tyr Glu Glu Lys Tyr Gly 
20 25 30 

Thr Lys Leu Ser Gin Ala His Asp Asp Cys Leu Arg Met Leu Ser Arg 
35 40 45 



Tip Met Cys Thr Ser Arg Pro Glu Arg Val Leu Asp lie Gly Cys Gly 
50 55 50 

Thr Gly Ala Leu lie Glu Arg Met Phe Ala Leu Trp Pro Glu Ala Arg 
65 70 75 80 

Phe Glu Gly Val Asp Pro Ala Gin Gly Met Val Asp Glu Ala Ala Lys 
85 90 95 

Arg Arg Pro Phe Ala Ser Phe Val Lys Gly Val Ala Glu Ala Leu Pro 
' 100 105 HO 

Phe Pro Ser Gin Ser Met Asp Leu Val Val Cys Ser Met Ser Phe Gly 
115 120 125 

His Trp Ala Asp Lys Ser Val Ser Leu Asn Glu Val Arg Arg Val Leu 
130 135 140 w to 

Lys Pro Gin Gly Leu Phe Cys Leu Val Glu Asn Leu Pro Ala Gly Trp 
145 150 155 160 



Gly Leu Thr Thr Leu He Asn Trp Leu Leu Gly Ser Leu Ala 
165 170 175 



Arg Ser Glu His Glu Val lie Gin Leu Ala Gin Thr Ala Gly Leu Gin 
180 185 190 



Ser Met Glu Thr Ser Val Thr Asp Gin His Val lie Val Ala Thr Phe 
195 200 205 



Arg Pro Cys Cys Gly Glu Val Gly Asp His Gly Arg 
210 215 220 



<210> SEQID 11 
<211> Length: 509 
<212> Type: PRT 

<213> Organism: Pseudomonas fluorescens A2-2 
<400> SEQUENCE: 11 

Met Val Val Lys Asn Lys Gin Val Leu Val Val Gly Ala Gly Pro Val 
15 10 15 



Gly Leu Ala Val Ala Ala Ala Leu Ala Glu Leu Gly lie Ala Val Asp 
20 25 30 



Leu He Asp Lys Arg Pro Ala Ala Ser Pro His Ser Arg Ala Phe Gly 
35 40 45 



Leu Glu Pro Val Thr Leu Glu Leu Leu Asn Ala Trp Gly Val Ala Asp 
50 55 60 



Glu Met lie Arg Arg Gly He Val Trp Ala Ser Ala Pro Leu Gly Asp 
65 70 75 80 



Lys Ala Gly Arg Thr Leu Ser Phe Ser Lys Leu Pro Cys Glu Tyr Pro 
85 90 95 



His Met Val lie lie Pro Gin Ser Gin Thr Glu Ser Val Leu Thr Asp 
100 105 110 
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Trp Val Asn Arg Lys Gly Val Asn Leu Lys Arg Gly Tyr Ala Leu Lys 
115 120 125 



Ala Leu Asp Ala Gly Asp Leu His Val Glu Val Thr Leu Glu His Ser 
130 135 140 



Glu Thr Gly Ser Val Gin Gin Ser Arg Tyr Asp Trp Val Leu Gly Ala 

ITU * Cr\ i r— r— " - 



150 



155 



160 



Asp Gly Val Asn Ser Ser Val Arg Gin Leu Leu Asn lie Ser Phe Val 
165 170 175 



Gly Gin Asp Tyr Lys His Ser Leu Val Val Ala Asp Val Val Leu Arg 



185 



190 



Asn Pro Pro Ser Pro Ala Val His Ala Arg Ser Val Ser Arg Gly Leu 
195 200 205 



Val Ala Leu Phe Pro Leu Pro Asp Gly Ser Tyr Arg Val Ser He Glu 
210 215 220 



Asp Asn Glu Arg Met Asp Thr Pro Val Lys Gin Pro Val Thr His Glu 
225 230 235 240 



Glu lie Ala Gly Gly Met Lys Asp He Leu Gly Thr Asp Phe Gly Leu 
245 250 255 



Ala Gin Val Leu Trp Ser Ala Arg Tyr Arg Ser Gin Gin Arg Leu Ala 
260 265 270 



His Tyr Arg Gin Gly Arg Val Phe Leu Leu Gly Asp Ala Ala His 
275 280 285 



Thr His Val Pro Ala Gly Gly Gin Gly Leu Gin Met Gly He Gly Asp 
290 295 300 



Ala Ala Asn Leu Ala Tip Lys Leu Ala Gly Val lie Gin Ala Thr Leu 
305 310 315 320 



Pro Met Asp Leu Leu Glu Ser Tyr Glu Ala Glu Arg Arg Pro He Ala 
325 330 335 



Ala Ala Ala Leu Arg Asn Thr Asp Leu Leu Phe Arg Phe Asn Thr Ala 
340 345 350 



Ser Gly Pro He Gly Arg Leu He His Trp lie Gly Leu Gin Ala Thr 
355 360 365 



Arg Ala Pro Tyr Val Ala Gin Lys Val Val Ser Ala Leu Ala Gly Glu 
370 375 380 



Gly Val Arg Tyr Asp Ser Val Arg Arg Arg Gly Asp His Arg Leu Val 
385 390 395 400 



Gly Arg Arg Leu Pro Leu Leu Ser Leu Leu Pro Glu Gly Glu Arg Leu 
405 410 415 



Pro Arg Gin Ser Leu Thr Gin Leu Leu Arg Ala Gly Arg Phe Val Leu 
420 425 430 



Val His His Arg Ala Lys Ala Leu Ala Ala Asp Leu Arg Arg Asp Phe 
435 440 445 



Pro Gly Leu Gin Thr Ala Ser lie Cys Glu Asp Ser His Asn Asn Ser 
450 455 460 



Leu Ser Ala Gly Glu Gly Val He Val Arg Pro Asp Gly Val Val lie 
465 470 475 480 
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Trp Val Gly Lys Lys Ser Thr Leu Ala Lys Glu Are Leu Gly Glu Tro 
485 490 495 



Leu Leu Asp Asp Ser Lys Ser Ala Arg Gin Ser Leu Thr 
500 505 



<210> SEQ ID 12 
<211> LENGHT: 348 
<212> TYPE: PRT 

<213> Organism: Pseudomonas fluorescens A2-2 
<400> SEQUENCE: 12 

Met Ala His Tyr Asp Ser Val Gly Thr Ala Pro Gly Ala Ser Asp Asp 
15 10 15 



Gly Met Ala Val Ala Ser lie Leu Gin Leu Met Arg Glu Thr lie Thr 
20 25 30 



Arg Ser Asp Ala Lys Asn Asn Val Val Phe Leu Leu Ala Asp Gly Glu 
35 40 45 



Glu Leu Gly Leu Leu Gly Ala Glu His Tyr Val Ser Gin Leu Ser Thr 
50 55 60 



Pro Glu Arg Glu Ala lie Arg Leu Val Leu Asn Phe Glu Ala Arg Gly 
65 70 75 80 

Asn Gin Gly lie Pro Leu Leu Phe Glu Thr Ser Gin Lys Asp Tyr Ala 
85 90 95 



Leu He Arg Thr Val Asn Ala Gly Val Arg Asp lie He Ser Phe Ser 
100 105 HQ 



Thr Pro Leu He Tyr Asn Met Leu Gin Asn Asp Thr Asp Phe Thr 



115 



120 



125 



Val Phe Arg Lys Lys Asn He Ala Gly Leu Asn Phe Ala Val Val Glu 
130 135 140 



Gly Phe Gin His Tyr His His Met Ser Asp Thr Val Glu Asn Leu Gly 
145 150 155 * 160 



Pro Glu Thr Leu Phe Arg Tyr Gin Lys Thr Val Arg Glu Val Gly Asn 
165 170 175 



His Phe lie Gin Gly He Asp Leu Ser Ser Leu Ser Ala Asp Glu Asp 
180 185 190 



Ala Thr Tyr Phe Pro Leu Pro Gly Gly Thr Leu Leu Val Leu Asn Leu 
195 200 205 



Pro Thr Leu Tyr Ala Leu Gly Met Gly Ser Phe Val Leu Cys Gly Leu 
210 215 220 



Trp Ala Gin Arg Cys Arg Thr Arg Arg Gin His Gin Gly Lys Asn Cys 
225 230 235 240 



Val Leu Arg Pro Met Ala He Ala Leu Leu Gly He Ala Cys Ala Ala 
245 250 255 



Leu Val Phe Tyr Val Pro Ser He Ala Tyr Leu Phe Val lie Pro Ser 
260 265 270 



Leu Leu Leu Ala Cys Ala Met Leu Ser Arg Ser Leu Phe He Ser Tyr 
275 280 285 ' 



Ser He Met Leu Leu Gly Ala Tyr Ala Cys Gly He Leu Tyr Ala Pro 
290 295 300 
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IleVal Tyr Leu lie Ser Ser Gly Leu Lys Met Pro Phe lie Ala Gly 
305 310 315 320 

Val He Ala Leu Leu Pro Leu Cys Leu Leu Ala Val Gly Leu Ala Glv 
325 330 335 



Val He Ala Arg Ser Arg Asp Cys Arg Thr Cys Asp 
340 345 



<210> SEQ ID 13 
<211> Lenght: 572 
<212> Type: PRT 

<213> Organism: Pseudomonas fluorescens A2-2 
<400> SEQUENCE: 13 

Met Arg Ser Leu Lys He lie Val Leu Ala Ser Ala Phe Asn Gly Leu 
1 5 10 15 



Thr Gin Arg Ala Trp Leu Asp Leu Arg Gin Ser Gly His Ala Pro Ser 
20 25 30 



Val Val Leu Phe Thr Asp Pro Ala Leu Val Cys Gin Gin He Glu Asp 
35 40 45 



Ser Asp Ala Asp Leu Val He Cys Pro Phe Leu Lys Asp Arg Val Pro 
50 55 60 



Gin Gin Leu Trp Ser Asn Leu Glu Arg Pro Val Val He He His Pro 
65 70 75 80 



Gly ne Val Gly Asp Arg Gly Ala Ser Ala Leu Asp Trp Ala He Ser 
85 90 95 



Gin Gin Val Gly Arg Trp Gly Val Thr Ala Leu Gin Ala Val Glu Glu 
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100 



105 



110 



Met Asp Ala Gly Pro lie Trp Ser Thr Cys Glu Phe Asp Met Pro Ala 
115 120 125 



Asp Val Arg Lys Ser Glu Leu Tyr Asn Gly Ala Val Ser Asp Ala Ala 
130 135 140 



Leu Tyr Cys lie Arg Asp Val Val Glu Lys Phe Ala Arg Val Phe Val 
145 150 155 160 



Pro Val Pro Leu Asp Tyr Thr Gin Ala His Val lie Gly Arg Leu Gin 
165 170 175 



Pro Asn Met Thr Gin Ala Asp Arg Thr Phe Ser Trp Tyr Asp Cys Ala 
180 185 190 



Arg Phe He Lys Arg Cys lie Asp Ala Ala Asp Gly Gin Pro Gly Val 
195 200 205 



Leu Ala Ser He Gin Gly Gly Gin Tyr Tyr Leu Tyr Asp Ala His Leu 
210 215 220 



Asp Ala Arg His Gly Thr Pro Gly Glu lie Leu Ala Val Gin Asp Asp 
225 230 235 240 



Ala Val Leu Val Ala Ala Gly Asp Gin Ser Leu Trp He Gly Ser Leu 
245 250 255 



Lys Arg Lys Ala Arg Pro Gly Glu Glu Thr Phe Lys Leu Pro Ala Arg 
260 265 270 



His Val Leu Ala Glu Ala Leu Ala Asp lie Pro Val Leu Asp Ser Ser 
275 280 285 



lie Ala Asn Gin Met Phe Asp Glu Gin Ala Tyr Gin Pro He Arg Tyr 
290 295 300 y 



Arg Glu Ala Gly His Val Gly Glu Leu Thr Phe Glu Phe Tyr Asn Gly 
305 310 315 320 

Ala Met Ser Thr Glu Gin Cys Gin Arg Leu Val Ala Ala Leu Arg Trp 
325 330 335 

Ala Lys Thr Arg Asp Thr Gin Val Leu Val He Lys Gly Gly Arg Glv 
340 345 350 g Y 

Ser Phe S er Asn Gly Val His Leu Asn Val lie Gin Ala Ala Pro Val 
35 5 360 365 

Pro Gly Leu Glu Ala Trp Ala Asn lie Gin Ala He Tyr Asp Val Cvs 
370 375 380 

His Glu Leu Leu Thr Ala Arg Gin Leu Val He Ser Gly Leu Thr Gly 
385 390 395 400 

Ser Ala Gly Ala Gly Gly Val Met Leu Ala Leu Ala Ala Asp lie Val 
405 410 415 

Leu Ma %% Glu Ser v al Val Leu Asn Pro His Tyr Lys Thr Met Gly 
420 425 430 " " 

Leu Tyr Gly S er Glu Tyr Trp Thr Tyr Ser Leu Pro Arg Ala Val Gly 
435 440 445 3 

Ser Glu Val Ala His Gin Leu Thr Asp Ala Cys Leu Pro He Ser Ala 
450 455 460 
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Gin Ala Glu Gin Tyr Gly Leu Val Gin Gly He Gly Pro Arg Cy; 



119 

465 470 475 480 



Pro His Ala Phe Ser Arg Trp Leu Met Gin Gin Ala Ser Ser Ala Leu 
485 490 495 



Thr Asp Glu Lys Tyr Ala Val Ala Arg Ala Arg Lys Ala Ala Leu Asp 
500 505 510 



lie Asp Gin lie Thr Arg Cys Arg Glu Ala Glu Leu Ala Gin Met Gin 
515 520 525 



Leu Asp Met Val His Asn Arg His Gin Phe Ala Glu Lys Cys Arg Asn 
530 535 540 



Phe Val Leu Lys Arg Lys Thr Cys Gin Thr Pro Gin Arg Leu Met Ala 
545 550 555 560 



Pro Trp Ala Val Ala Arg Glu Ala Ala Leu Val Gly 
565 570 



<210> SEQ ID 14 
<211> Lenght: 230 
<212> Type: PRT 

<213> Organism: Pseudomonas fluorescens A2-2 
<400> SEQUENCE: 14 

Met lie Gly He Val He Pro Ala His Asn Glu Glu Arg His lie Ser 
15 10 15 



Ala Cys Leu Ala Ser He Gin Arg Ala lie Ala His Pro Ala Leu Ala 
20 25 30 



His Gin Gin Val Gin Leu Leu Val Val Leu Asp Ala Cys Ser Asp Glu 
35 40 45 



# 



Thr Ala Thr Arg Val Ser Ala Met Gly Val Ala Thr Leu Glu Val Ser 



55 



60 



Val Arg Asn Val Gly Lys Ala Arg Ala Leu Gly Ala Glu Arg Leu Leu 



70 



75 



80 



Glu Val Gly Ala Gin Tip Leu Ala Phe ThrAsp Ala Asp Thr Val Val 



90 



95 



Pro Ala Asp Trp Leu Val Arg Gin lie Gly Phe Gly Ala Asp Ala Val 



105 



110 



Cys Gly Thr Val Glu Val Asp Ser Trp Ser Glu Tyr Gly Glu Ser Val 
AJ-is 120 ioc: 



125 



Arg Ser Arg Tyr Leu Glu Leu Tyr Gin Phe Thr Glu Asn His Arg His 



135 



140 



lie His Gly Ala Asn Leu Gly LeuSer Ala Asp Ala Tyr Arg Asn Ala 



150 



155 



160 



Gly Gly Phe Gin His Leu Val Ala His Glu Asp Val Gin Leu Val Ala 
ioS 170 175 



Asp Leu Glu Arg lie Gly Ala Arg He Val Trp Thr Ala Thr Asn Pro 
180 185 190 



Val ValThr Ser Ala Arg Arg Asp TyrJLys Cys Arg Gly Gly Phe Gly 



200 



205 



Glu Tyr Leu Ala Ser Leu Val Ala Glu Gly Thr Arg Glu His Ser Pro 



215 



220 



Ala His Ala Pro He Gly 
225 230 



<210> SEQ ID 15 
<211> Lenght: 348 
<212> Type. PRT 

<213> Organism: Pseudomonas fluorescens A2-2 
<400> SEQUENCE: 15 

Met His Pro His Lys Thr Ala He Val Leu He Glu Tyr Gin Asn Asp 
1 5 10 15 



Phe Thr Thr Pro Gly Gly Val Phe His Asp Ala Val Lys Asp Val Met 
20 25 30 



Gin Thr Ser Asn Met Leu Ala Asn Thr Ala Thr Thr He Glu Gin Ala 
35 40 45 



Arg Lys Leu Gly Val Lys He He His Leu Pro He Arg Phe Ala Asp 
50 55 60 



Gly Tyr Pro Glu Leu Thr Leu Arg Ser Tyr Gly He Leu Lys Gly Val 
65 70 75 80 



Ala Asp Gly Ser Ala Phe Arg Ala Gly Ser Trp Gly Ala Glu lie Thr 
85 90 95 



Asp Ala Leu Lys Arg Asp Pro Thr Asp He Val lie Glu Gly Lys Arg 
100 105 110 



Gly Leu Asp Ala Phe Ala Thr Thr Gly Leu Asp Leu Val Leu Arg Asn 
115 120 125 



Asn Gly lie Gin Asn Leu Val Val Ala Gly Phe Leu Thr Asn Cys Cys 
130 135 140 



Val Glu Gly Thr Val Arg Ser Gly Tyr Glu Lys Gly Tyr Asp Val Val 
145 150 155 160 



Thr Leu Thr Asp Cys Thr Ala Thr Phe Ser Asp Glu Gin Gin Are Ala 
165 170 175 



Ala Glu Gin Phe Thr Leu Pro Met Phe Phe Ala Asn Pro Ala Thr His 
180 185 190 



Arg Val Ser Ala Ser Thr Glu Arg Arg He Lys Lys Ala Ala Thr Pro 
195 200 205 
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Ala Glu Ser Pro Leu Phe Cys Leu Gly His Ser Val Gly Ala Tyr Cvs 

210 OIC r^^-x^ J J 



215 



220 



lie Ser Pro Phe Pro Asn Asp Gin Ser Ser Arg Phe Thr Ser Thr Are 
225 <->or» "~ --- => 



230 



235 



240 



Leu lie His Thr Ser Ser Leu Arg Ser Pro Val Leu Ala Tip Met Pro 
245 250 255 



Ser Ala Met Asn Leu Lys Ala Phe Phe Thr Ser Met Leu Arg Pro Ala 
260 265 270 



Phe His Val Thr Trp He Asn Thr lie Leu Gly Val Val Thr Pro Are 
275 280 285 



Tyr Pro Ala Ala Gly Thr Ser Ser Ser Leu Ala Trp Arg Leu Met He 
290 295 300 



Trp Asn Leu Ser Cys Ser Gly Thr Leu Ala Thr Leu Val lie Ala Ala 
305 310 315 320 



Tyr Thr Thr Ser Pro Met Ala Val Ala Val Ser Val Glu Val Ser Ala 
325 330 335 



Ala Arg Ser lie Arg Thr Lys Gly Met Asp Lys Ser 
340 345 
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